计算机科学 ›› 2011, Vol. 38 ›› Issue (1): 87-90.

• 计算机网络与信息安全 • 上一篇    下一篇

基于统计学习的挂马网页实时检测

王涛,余顺争   

  1. (广东工业大学自动化学院 广州510006);(中山大学信息科学与技术学院电子与通信工程系 广州510006)
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受国家高技术研究发展计划(863计划)专题课题( 2007AA01Z449 ),国家自然科学基金-广东联合基金重点项目(U0735002),国家自然科学基金面上项目(60970146),教育部博士点专项基金(20090171120001)资助。

Real-time Detection of Malicious Web Pages Based on Statistical Learning

WANG Tao,YU Shun-zheng   

  • Online:2018-11-16 Published:2018-11-16

摘要: 近年来挂马网页对Web安全造成严重威胁,客户端的主要防御手段包括反病毒软件与恶意站点黑名单。反病毒软件采用特征码匹配方法,无法有效检测经过加密与混淆变形的网页脚本代码;黑名单无法防御最新出现的恶意站点。提出一种新型的、与网页内容代码无关的挂马网页实时检测方法。该方法主要提取访问网页时HTTP会话过程的各种统计特征,利用决策树机器学习方法构建挂马网页分类模型并用于在线实时检测。实验证明,该方法能够达到89. 7%的挂马网页检测率与0. 3%的误检率。

关键词: 挂马网页,HTTP会话,决策树,机器学习

Abstract: Malicious Web pages impose increasing threats on Web security in recent years. Currently, there are mainly two client side protection approaches including anti-virus software packages and blacklists of malicious sites. Anti-virus techniques commonly use signaturcbased approaches which might not be able to efficiently identify malicious HTMI codes with encryption and obfuscation. Furthermore, blacklisting techniques are difficult to keep up-to-date. This paper presented a novel classification method for real-time detecting malicious Web pages which is independent with the contents of Web pages. Our approach characterizes malicious Web pages using HTTP session information. With representafive statistical features and decision tree algorithm in machine learning,we built an effective classification model for online real-time detecting malicious Web pages. Experiment results demonstrate that we arc able to successfully detect 89. 7% of the malicious Web pages with a low false positive rate of 0. 3%.

Key words: Malicious Web pages,HTTP session,Decision tree,Machine learning

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!