论文部分内容阅读
为了更加合理地组织Web服务器的结构,需要通过Web日志挖掘分析用户的浏览模式,而Web日志挖掘中的数据预处理工作关系到挖掘的质量。文章就此进行了深入的研究,提出一个包括数据净化、用户识别、会话识别和路径补充等过程的数据预处理模型,并通过一个实例具体介绍了各过程的主要任务。
In order to organize the structure of Web server more reasonably, it is necessary to analyze the user’s browsing mode through Web log mining. The data preprocessing in Web log mining is related to the quality of mining. This article has carried on the thorough research to this end, proposed a data preprocessing model including data purification, user identification, conversation recognition and route supplementing and so on. And through an example introduced the main task of each course concretely.