计算机科学 ›› 2018, Vol. 45 ›› Issue (6A): 508-512.

• 综合、交叉与应用 • 上一篇    下一篇

改进的并行fp-growth算法在工业设备故障诊断中的应用研究

张斌1,滕俊杰1,满毅2   

  1. 桂林电子科技大学计算机与信息安全学院 广西 桂林5410041
    北京邮电大学电子工程学院 北京1008762
  • 出版日期:2018-06-20 发布日期:2018-08-03
  • 作者简介:张 斌(1970-),男,硕士,副教授,主要研究方向为计算机网络与应用技术、计算机控制技术、网络安全;滕俊杰(1990-),男,硕士生,主要研究方向为大数据挖掘,E-mail:505091074@qq.com(通信作者);满 毅(1975-),男,博士,副教授,主要研究方向为大数据分析与挖掘、网管等。
  • 基金资助:
    国家自然科学基金(61762028),桂林市科学研究与技术开发计划(20160218-1)资助

Application Research of Improved Parallel Fp-growth Algorithm in Fault Diagnosis
of Industrial Equipment

ZHANG Bin1,TENG Jun-jie1,MAN Yi2   

  1. School of Computer and Information Security,Guilin University of Electronic Technology,Guilin,Guangxi 541004,China1
    School of Electronic Engineering,Beijing University of Posts and Telecommunications,Beijing 100876,China2
  • Online:2018-06-20 Published:2018-08-03

摘要: 如今,工业设备不断向智能化、大型化发展,伴随着设备故障日益复杂多样,如何快速、准确地诊断故障成为一个难题。通过研究,提出以大数据技术Hadoop为平台,基于兴趣属性列的改进的fp-growth算法作为数据挖掘方法,来实现工业设备的故障诊断。实验以工业齿轮箱为例,首先选取两部分数据分别作为训练数据和测试数据,在预处理阶段对训练数据进行空值处理、维度相关性分析以及抽样离散化数据;其次提出基于兴趣属性列的改进的并行fp-growth算法,从训练数据中挖掘出属性列与故障之间的关联规则;最后通过测试数据验证关联规则,证明了改进方法的可行性。实验结果表明,基于兴趣属性列改进的并行fp-growth算法能够在保证准确率的情况下进行快速故障诊断。

关键词: Fp-growth, Hadoop, 故障诊断

Abstract: Nowadays,industrial equipment is becoming more and more intelligent and large-scale.Along with the increasingly complex and diverse equipment failures,how to diagnose faults quickly and accurately has become a challenge.Hence,taking big data technology Hadoop as platform,fp-growth is utilized as big data mining method to realize the fault diagnosis of industrial equipment.Taking the industrial gear box as example,firstly,the two parts of data are selected as the training data and the test data respectively.In preprocessing stage,the training data is processed by null value,the correlation analysis of dimension and discretization of data.Secondly,this paper put forward an improved paral-lel fp-growth algorithm based on interest to mine the association rules between attribute columns and faults by the training data.Finally,the association rules were verified by the test data to prove the feasibility of the improved method.Experiment results show that the proposed interest based improved parallel fp-growth algorithm can perform fault diagnosis efficiently with accuracy.

Key words: Fault diagnosis, Fp-growth, Hadoop

中图分类号: 

  • TP277
[1]WU L,YAO B,PENG Z,et al.Fault Diagnosis of Roller Bea- rings Based on a Wavelet Neural Network and Manifold Lear-ning[J].Applied Sciences,2017,7(2):158.
[2]CERRADA M,ZURITA G,CABRERA D,et al.Fault diagnosis in spur gears based on genetic algorithm and random forest[J].Mechanical Systems & Signal Processing,2016,70-71:87-103.
[3]胡辉.基于Hadoop的动车组故障数据关联规则挖掘研究与实现[D].北京:北京交通大学,2015.
[4]WHITE T,CUTTING D.Hadoop:the definitive guide[J].O’reilly Media IncGravenstein Highway North,2009,215(11):1-4.
[5]SHVACHKO K,KUANG H,RADIA S,et al.The Hadoop Distributed File System[C]∥MASS Storage Systems and Techno-logies.IEEE,2010:1-10.
[6]DEAN J,GHEMAWAT S.MapReduce:A Flexible Data Processing Tool[J].Communications of the Acm,2010,53(1):72-77.
[7]AGRAWAL R,SHAFER J C.Parallel Mining of Association Rules[J].Journal of Supercomputing,2007,39(3):273-299.
[8]LEI Y,JIA F,LIN J,et al.An Intelligent Fault Diagnosis Me- thod Using Unsupervised Feature Learning Towards Mechanical Big Data[J].IEEE Transactions on Industrial Electronics,2016,63(5):3137-3147.
[9]WANG B,CHEN D,SHI B,et al.Comprehensive Association Rules Mining of Health Examination Data with an Extended FP-Growth Method[J].Mobile Networks & Applications,2017,22(2):1-8.
[10]刘鑫,贾云献,孙磊,等.基于BP神经网络的变速箱故障诊断方法研究[J].计算机测量与控制,2017,25(1):12-15.
[11]谢宏,程浩忠,牛东晓.基于信息熵的粗糙集连续属性离散化算法[J].计算机学报,2005,28(9):1570-1574.
[12]杨世海,李涛,陈铭明,等.基于数据挖掘的智能电网在线故障诊断与分析[J].电子设计工程,2017,25(1):136-139.
[13]LIU D,LIN Y,HUANG P C,et al.Durable and Energy Efficient In-Memory Frequent Pattern Mining[J].IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems,2017,PP(99):1-1.
[14]ZENG Y,YIN S,LIU J,et al.Research of improved FP-Growth algorithm in association rules mining[J].Scientific Programming,2015,2015:6.
[15]杨鹏坤,彭慧,周晓锋,等.改进的基于频繁模式树的最大频繁项集挖掘算法——FP-MFIA[J].计算机应用,2015,35(3):775-778.
[1] 曹扬晨, 朱国胜, 孙文和, 吴善超.
未知网络攻击识别关键技术研究
Study on Key Technologies of Unknown Network Attack Identification
计算机科学, 2022, 49(6A): 581-587. https://doi.org/10.11896/jsjkx.210400044
[2] 田冰川, 田臣, 周宇航, 陈贵海, 窦万春.
减少Hadoop集群中网络队头阻塞的调度算法
Reducing Head-of-Line Blocking on Network in Hadoop Clusters
计算机科学, 2022, 49(3): 11-22. https://doi.org/10.11896/jsjkx.210900117
[3] 白勇, 张占龙, 熊隽迪.
基于FP-Growth算法和GRNN的电力知识文本挖掘
Power Knowledge Text Mining Based on FP-Growth Algorithm and GRNN
计算机科学, 2021, 48(8): 86-90. https://doi.org/10.11896/jsjkx.210600031
[4] 雷剑梅, 曾令秋, 牟洁, 陈立东, 王淙, 柴勇.
基于整车EMC标准测试和机器学习的反向诊断方法
Reverse Diagnostic Method Based on Vehicle EMC Standard Test and Machine Learning
计算机科学, 2021, 48(6): 190-195. https://doi.org/10.11896/jsjkx.200700204
[5] 王焘, 张树东, 李安, 邵亚茹, 张文博.
一种面向异常传播的微服务故障诊断方法
Anomaly Propagation Based Fault Diagnosis for Microservices
计算机科学, 2021, 48(12): 8-16. https://doi.org/10.11896/jsjkx.210100149
[6] 王青松, 姜富山, 李菲.
大数据环境下基于关联规则的多标签学习算法
Multi-label Learning Algorithm Based on Association Rules in Big Data Environment
计算机科学, 2020, 47(5): 90-95. https://doi.org/10.11896/jsjkx.190300150
[7] 朱晓玲, 李琨, 张长胜, 杜付鑫.
基于Gabor小波变换和多核支持向量机的电梯导靴故障诊断方法
Elevator Boot Fault Diagnosis Method Based on Gabor Wavelet Transform and Multi-coreSupport Vector Machine
计算机科学, 2020, 47(12): 258-261. https://doi.org/10.11896/jsjkx.200700039
[8] 林毅, 吉鸿江, 韩佳佳, 张德平.
一种基于马氏距离的系统故障诊断方法
System Fault Diagnosis Method Based on Mahalanobis Distance Metric
计算机科学, 2020, 47(11A): 57-63. https://doi.org/10.11896/jsjkx.190900174
[9] 贾宁, 李瑛达.
基于智能可穿戴设备的个性化健康监管平台的构建
Construction of Personalized Health Monitoring Platform Based on Intelligent Wearable Device
计算机科学, 2019, 46(6A): 566-570.
[10] 郭杨, 梁家荣, 刘峰, 谢敏.
一种基于超立方体网络的高效故障诊断并行算法
Novel Fault Diagnosis Parallel Algorithm for Hypercube Networks
计算机科学, 2019, 46(5): 73-76. https://doi.org/10.11896/j.issn.1002-137X.2019.05.011
[11] 王岩, 罗倩, 邓辉.
基于变分贝叶斯的轴承故障诊断方法
Bearing Fault Diagnosis Method Based on Variational Bayes
计算机科学, 2019, 46(11): 323-327. https://doi.org/10.11896/jsjkx.180901719
[12] 张刚, 高俊鹏, 李红威.
级联三稳态随机共振的特性研究及应用
Research on Stochastic Resonance Characteristics of Cascaded Three-steady-state and Its Application
计算机科学, 2018, 45(9): 146-151. https://doi.org/10.11896/j.issn.1002-137X.2018.09.023
[13] 薛善良,杨佩茹,周奚.
基于模糊神经网络的WSN无线数据收发单元故障诊断
WSN Wireless Data Transceiver Unit Fault Diagnosis with Fuzzy Neural Network
计算机科学, 2018, 45(5): 38-43. https://doi.org/10.11896/j.issn.1002-137X.2018.05.006
[14] 应毅, 任凯, 刘亚军.
基于大数据的网络日志分析技术
Network Log Analysis Technology Based on Big Data
计算机科学, 2018, 45(11A): 353-355.
[15] 丁勇, 朱长水, 武玉艳.
一种基于Hadoop的关联规则挖掘算法
Association Rule Mining Algorithm Based on Hadoop
计算机科学, 2018, 45(11A): 409-411.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!