计算机科学 ›› 2010, Vol. 37 ›› Issue (12): 35-40.

• 计算机网络与信息安全 • 上一篇    下一篇

基于机器学习的IP流量分类研究

刘琼,刘珍,黄敏   

  1. (华南理工大学软件学院,广州510006)
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受华南理工大学985科研启动经费(D606001II)资助。

Study on Internet Traffic Classification Using Machine Learning

LIU Qiong,LIU Zhen,HUANU Min   

  • Online:2018-12-01 Published:2018-12-01

摘要: IP流量分类是Internet研究和流量工程的重要基础,近年来网络应用类别和Internet流数量在快速增长。流量分类技术不断面临新的挑战。对基于机器学习的IP流量分类方法进行了系统性研究。给出了这类流量分类方法的数学描述;通过深入研究有监督和无监督机器学习方法在流量分类中的应用,从数据预处理、模型构建和模型评估3个方面评述这类技术的研究现状,并指出存在的问题;总结得出现阶段基于机器学习的IP流量分类技术存在数据偏斜、标识瓶颈、属性变化和实时分类等4个方面的共性问题;最后展望了流量分类技术的未来发展方向并介绍了作者正在进行的工作。

关键词: 流量分类,机器学习,网络流,网络测量

Abstract: Internet traffic classification is one of the key foundations for research works and traffic engineering in Internet. The categories of Internet applications and the number of Internet flows increase fast last years. The technique challenges arc coped with development of traffic classification all the time. A research was carried out systematically in Internet traffic classification using machine learning in the paper. A mathematical description was given for the technology of traffic classification first. After surveying the technology of traffic classification based on supervised learning and unsupervised learning, we commented on the data preprocessing, classification model and performance evaluation etc. three aspects, and indicated the shortages now in these studies. Then, four common issues were summarized, which are data skew, label bottleneck, attribute changing and classify timely respectively. At the end, the prospect and our going work in this area were pointed out.

Key words: Traffic classification, Machine learning, Traffic flow, Network measurements

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!