计算机科学 ›› 2012, Vol. 39 ›› Issue (Z11): 200-203.

• 软件工程 • 上一篇    下一篇

基于m-Markov模型的交叉用户会话识别

黄 浩,李 兵,姜 丹   

  1. (对外经济贸易大学信息学院 北京100029)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Identifying Interleaved User Sessions Based on m-Markov Model

  • Online:2018-11-16 Published:2018-11-16

摘要: Web访问日志数据是由单个用户点击形成的数据集,各点击是独立的,会话识别的任务就是将各个独立的点击划分成有意义的会话片段。一般的会话识别算法无法对包含交又会话数据的Web访问日志数据成功地进行会话识别,利用自适应m-Markov模型能对Web访问日志数据进行交又服务器会话识别和重构,并在m-Markov模型的基础上结合不同的会话结束判断准则进行会话识别准确率的比较。实验结果显示,将m-Markov模型与基于奖惩策略的会话结束算法结合能明显提高会话识别和重构的准确率。

关键词: Web使用挖掘,Markov模型,会话重构

Abstract: The data of Web log is record collection of users' access request independently. The task of identifying sessions is to partition the independent access request to session segments. But the general reconstructed session algorithms can't identify the sessions from the Web log which includes many interleaved sessions data. This paper presents an approach for interleaved server session from Web server logs using self-adaptive m-order Markov model combined with server different principles which detect the end of sessions. The proposed approach has the ability to reconstruct interlcaved sessions from server logs. The experiments show that combining rnMarkov model with rewords-punishment policy will achieve a significant improvement in regarding interleaved sessions compared to the traditional methods.

Key words: Web usage mining,Markov model,Session reconstruction

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!