文章引用说明 更多>> (返回到该文章)

许晓东, 李柯, 朱士瑞 (2010) Web使用挖掘中Apriori算法的改进研究. 计算机工程与设计, 3, 539-541.

被以下文章引用:

  • 标题: Apriori算法在发现用户网页浏览模式上的应用Application of Apriori Algorithm in Finding User’s Webpage Browsing Mode

    作者: 魏林, 刘建毅, 王枞

    关键字: Web日志, Apriori算法, Web日志挖掘, 会话识别, k-项候选集Web Logs; Apriori Algorithm; Data Mining; Session Identification; k-Candidate Set

    期刊名称: 《Software Engineering and Applications》, Vol.2 No.6, 2013-12-13

    摘要: web服务器的日志文件记录了大量的用户网页访问信息,如何分析这些数据并从中发现用户的网页浏览模式比如用户感兴趣的页面、最佳的页面组合等从而为商家提供良好的决策支持变得越来越重要。本文用数据挖掘技术中的Apriori算法对记录用户页面访问信息的日志数据进行挖掘从而得到用户浏览网页的模式。本文首先对日志数据进行了预处理,从中提取了用户的一次会话中的页面访问记录,然后用Apriori算法对这些访问记录数据进行挖掘,同时针对这些待挖掘数据上的特点对挖掘算法Apriori在k-项候选集与事务的匹配上进行了改进,实验结果表明改进后的算法在处理数据量很大的数据时性能较传统算法有很好的提高。最后本文对挖掘后产生的规则进行了分析,发现了用户对本网站的一些网页的浏览模式,这些浏览模式为商家提供良好的决策支持。The log file of web server which recorded a large number of user’s visiting webpage information, and how to analyze these data and discover the user’s webpage browsing mode such as the webpages which users’ interested in browsing and the best page composition so as to provide a good decision support for merchants has become increasingly important. In this paper, Apriori algorithm was used to mine the log data of recording use’s accessing information for finding the regular pattern of user’s browsing the webpage. Firstly, this paper made data preprocessing to the log data for extracting one session access record of user. Secondly, the Apriori algorithm was used to mine these record data, considering the feature of these data, the paper made litter improvement for the algorithm at the matching of k-candidate set and the transaction. The experimental results showed that the performance of the improved algorithm in handling a large amount of data has a good improvement. Finally, this paper analysed the rules by excavating, and through these rules, some browsing modes were found, which provided decision supports for merchants.

在线客服:
对外合作:
联系方式:400-6379-560
投诉建议:feedback@hanspub.org
客服号

人工客服,优惠资讯,稿件咨询
公众号

科技前沿与学术知识分享