计算机科学 ›› 2021, Vol. 48 ›› Issue (6): 246-252.doi: 10.11896/jsjkx.201200142

• 计算机网络 • 上一篇    下一篇

基于随机森林的高性能互连网络阻塞故障检测

徐佳庆, 胡小月, 唐付桥, 王强, 何杰   

  1. 国防科技大学计算机学院 长沙410073
  • 收稿日期:2020-12-15 修回日期:2021-03-21 出版日期:2021-06-15 发布日期:2021-06-03
  • 通讯作者: 徐佳庆(xujiaqing@nudt.edu.cn)
  • 基金资助:
    国家重点研发计划(2018YFB0204300);国防科技重点实验室基金(6142110180101)

Detecting Blocking Failure in High Performance Interconnection Networks Based on Random Forest

XU Jia-qing, HU Xiao-yue, TANG Fu-qiao, WANG Qiang, HE Jie   

  1. School of Computer,National University of Defense Technology,Changsha 410073,China
  • Received:2020-12-15 Revised:2021-03-21 Online:2021-06-15 Published:2021-06-03
  • About author:XU Jia-qing,born in 1982,Ph.D,research assistant,is a member of China Computer Federation.His main research interests include high-speed Interconnection Networks and AIOPs.
  • Supported by:
    National Key Research and Development Program of China(2018YFB0204300) and Foundation of National Defense Technology Key Laboratory(6142110180101).

摘要: 高性能互连网络是高性能计算机系统中各节点高速协同并行计算的关键。在高性能互连网络的运维过程中,由链路质量恶化引发的网络端口阻塞故障定位困难,一旦发生网络端口阻塞,轻则会导致网络中的丢包率和端对端延迟升高,重则会造成整个网络的瘫痪,严重影响整个系统的可靠性。随着人工智能时代的到来,智能运维已经在网络运维中发挥了重要作用,但是基于高性能互连网络的智能运维研究相对较少。文中基于运维人员在自研高速互连网络运维中积累的大量数据和丰富经验,提出使用有监督的随机森林方法进行网络阻塞检测,实验结果表明,该方法在保持平均95%的召回率的前提下,平均准确率为93.7%,能够有效地解决网络阻塞的检测问题。

关键词: 故障检测, 互连网络, 随机森林

Abstract: High performance interconnection network is the key to high speed collaborative parallel of all nodes in high perfor-mance computer system.During the operation and maintenance of high performance interconnection networks,it is found that the network port blocking fault caused by the deterioration of link quality is difficult to locate and greatly affects the availability of the whole system.With the advent of the era of artificial intelligence,intelligent operation and maintenance has played an important role in network operation and maintenance.However,research on intelligent operation and maintenance based on high performance interconnection network is relatively few.This paper is based on a large amount of data and rich experience accumulated by operations staff in the self-development of high speed interconnection network operation and maintenance.It proposes a supervised random forest method for network blocking detection.The experimental results show that the proposed method has an ave-rage accuracy of 93.7% while maintaining an average recall rate of 95%,and can effectively solve the problem of network blocking detection.

Key words: Failure detection, Interconnection networks, Random forest

中图分类号: 

  • TP391
[1]LAPTEV N,AMIZADEH S,FLINT I.Generic and scalableframework forautomated time-series anomaly detection[C]//Proceedings of the 21th ACM SIGKDD International Confe-rence on Knowledge Discovery and Data Mining.2015:1939-1947.
[2]LIU D,ZHAO Y,XU H,et al.Opprentice:Towards practical and automatic anomaly detection through machine learning[C]//Proceedings of the 2015 Internet Measurement Confe-rence.2015:211-224.
[3]CHALAPATHY R,BORZESHI E Z,PICCARDI M.An investigation of recurrent neural architectures for drug name recognition[J].arXiv:1609.07585,2016.
[4]CHEN S,PENG M,XIONG H,et al.SVM intrusion detection model based on compressed sampling[J].Journal of Electrical and Computer Engineering,2016,2016(pt.1):12.
[5]JIANG J,JING X,LV B,et al.A Novel Multi-classification Intrusion Detection Model Based on Relevance Vector Machine[C]//2015 11th International Conference on Computational Intelligence and Security (CIS).IEEE,2015:303-307.
[6]HASSANZADEH R,NAYAK R.A semi-supervised graph-based algorithm for detecting outliers in online-social-networks[C]//Proceedings of the 28th Annual ACM Symposium on Applied Computing.2013:577-582.
[7]RUFF L,VANDERMEULEN R A,GÖRNITZ N,et al.Deep semi-supervised anomaly detection[J].arXiv:1906.02694,2019.
[8]PANG G,HENGEL A,SHEN C.Weakly-supervised DeepAnomaly Detection [J].arXiv:1910.13601v2,2020.
[9]AMER M,GOLDSTEIN M,ABDENNADHER S.Enhancingone-class support vectormachines for unsupervised anomaly detection[C]//Proceedings of the ACM SIGKDD Workshop on Outlier Detection and Description.2013:8-15.
[10]MÜNZ G,LI S,CARLE G.Traffic anomaly detection using k-means clustering[C]//GI/ITG-Workshop MMBnet 2007.
[11]STEFANIDIS K,VOYIATZIS A G.An HMM-based anomaly detection approach for SCADA systems[C]//IFIP International Conference on Information Security Theory and Practice.Cham:Springer,2016:85-99.
[12]ZHANG C,SONG D,CHEN Y,et al.A deep neural network for unsupervised anomaly detection and diagnosis in multivariatetime series data[C]//Proceedings of the AAAI Conference onArtificial Intelligence.2019,33:1409-1416.
[13]XU H,CHEN W,ZHAO N,et al.Unsupervised anomaly detection via variational auto-encoder for seasonal kpis in web applications[C]//Proceedings of the 2018 World Wide Web Confe-rence.2018:187-196.
[14]ZENATI H,FOO C S,LECOUAT B,et al.Efficient gan-based anomaly detection[J].arXiv:1802.06222,2018.
[15]CAO J J,XIAO L Q,WANG K F,et al.Implementation andevaluation of in-band management of supercomputing system interconnection network[J].Journal of Computers,2015,38:119.
[1] 高振卓, 王志海, 刘海洋.
嵌入典型时间序列特征的随机Shapelet森林算法
Random Shapelet Forest Algorithm Embedded with Canonical Time Series Features
计算机科学, 2022, 49(7): 40-49. https://doi.org/10.11896/jsjkx.210700226
[2] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[3] 阙华坤, 冯小峰, 刘盼龙, 郭文翀, 李健, 曾伟良, 范竞敏.
Grassberger熵随机森林在窃电行为检测的应用
Application of Grassberger Entropy Random Forest to Power-stealing Behavior Detection
计算机科学, 2022, 49(6A): 790-794. https://doi.org/10.11896/jsjkx.210800032
[4] 王文强, 贾星星, 李朋.
自适应的集成定序算法
Adaptive Ensemble Ordering Algorithm
计算机科学, 2022, 49(6A): 242-246. https://doi.org/10.11896/jsjkx.210200108
[5] 章晓庆, 方建生, 肖尊杰, 陈浜, RisaHIGASHITA, 陈婉, 袁进, 刘江.
基于眼前节相干光断层扫描成像的核性白内障分类算法
Classification Algorithm of Nuclear Cataract Based on Anterior Segment Coherence Tomography Image
计算机科学, 2022, 49(3): 204-210. https://doi.org/10.11896/jsjkx.201100085
[6] 刘振宇, 宋晓莹.
一种可用于分类型属性数据的多变量回归森林
Multivariate Regression Forest for Categorical Attribute Data
计算机科学, 2022, 49(1): 108-114. https://doi.org/10.11896/jsjkx.201200189
[7] 杨小琴, 刘国军, 郭建慧, 马文涛.
基于随机森林的空域-频域联合特征全参考彩色图像质量评价方法
Full Reference Color Image Quality Assessment Method Based on Spatial and Frequency Domain Joint Features with Random Forest
计算机科学, 2021, 48(8): 99-105. https://doi.org/10.11896/jsjkx.200700106
[8] 郑建华, 李小敏, 刘双印, 李迪.
融合级联上采样与下采样的改进随机森林不平衡数据分类算法
Improved Random Forest Imbalance Data Classification Algorithm Combining Cascaded Up-sampling and Down-sampling
计算机科学, 2021, 48(7): 145-154. https://doi.org/10.11896/jsjkx.200800120
[9] 李娜娜, 王勇, 周林, 邹春明, 田英杰, 郭乃网.
基于特征重要度二次筛选的DDoS攻击随机森林检测方法
DDoS Attack Random Forest Detection Method Based on Secondary Screening of Feature Importance
计算机科学, 2021, 48(6A): 464-467. https://doi.org/10.11896/jsjkx.200900101
[10] 曹扬晨, 朱国胜, 祁小云, 邹洁.
基于随机森林的入侵检测分类研究
Research on Intrusion Detection Classification Based on Random Forest
计算机科学, 2021, 48(6A): 459-463. https://doi.org/10.11896/jsjkx.200600161
[11] 周益旻, 刘方正, 王勇.
基于混合方法的IPSec VPN加密流量识别
IPSec VPN Encrypted Traffic Identification Based on Hybrid Method
计算机科学, 2021, 48(4): 295-302. https://doi.org/10.11896/jsjkx.200700189
[12] 冯凯, 马鑫玉.
(n,k)-冒泡排序网络的子网络可靠性
Subnetwork Reliability of (n,k)-bubble-sort Networks
计算机科学, 2021, 48(4): 43-48. https://doi.org/10.11896/jsjkx.201100139
[13] 张天瑞, 魏铭琦, 高秀秀.
基于IPSO-WRF的选择性激光烧结件气泡溶解时间预测模型
Prediction Model of Bubble Dissolution Time in Selective Laser Sintering Based on IPSO-WRF
计算机科学, 2021, 48(11A): 638-643. https://doi.org/10.11896/jsjkx.210300080
[14] 刘振鹏, 苏楠, 秦益文, 卢家欢, 李小菲.
FS-CRF:基于特征切分与级联随机森林的异常点检测模型
FS-CRF:Outlier Detection Model Based on Feature Segmentation and Cascaded Random Forest
计算机科学, 2020, 47(8): 185-188. https://doi.org/10.11896/jsjkx.190600162
[15] 冯凯, 李婧.
k元n方体的子网络可靠性研究
Study on Subnetwork Reliability of k-ary n-cubes
计算机科学, 2020, 47(7): 31-36. https://doi.org/10.11896/jsjkx.190700170
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!