Computer Science ›› 2013, Vol. 40 ›› Issue (Z6): 302-306.

Previous Articles     Next Articles

Fault Location Technology Based on the Distributed Event Processing System

DU Cui-lan,TAN Jian-long,WANG Xiao-yan,ZHANG Yu,LIU Ping and FAN Dong-jin   

  • Online:2018-11-16 Published:2018-11-16

Abstract: In recent years,distributed computing systems become larger and more complex to control.System faults are growing exponentially,resulting in a very serious harm and loss,and problems on trouble shooting and positioning difficulty further enlarges.Traditional ways by tracking program to judge the running and correct method,using excessive consumption of the target program and invasive in distributed monitoring information interaction,has been difficult to meet the demand of software behavior analysis.Through the complex event processing in time to find and locate the fault,this need in events in a large,rapid,uninterrupted occurrence of distributed monitoring environment appears especially urgent.It can use the meaningful information state change events to analyze system behaviors,and then judge the system operating conditions,to detect fault and positioning system,ensure the healthy operation.The complex event description language is based on the SQL method to describe the complex events.This data stream query language is complex for ordinary users and difficult to master.By constructing a set based event flow model,we can use the set of events to conduct a formal definition.The user only needs to master a few simple assembly operations in order to define complex fault rule.

Key words: Distributed network,Real-time Monitoring system,Fault location

[1] Kamoshida Y,Taura K.Scalable Data Gathering for Real-Time Monitoring Systems on Distributed Computing[C]∥Procee-dings of IEEE International Symposium on Cluster Computing and the Grid.Tokyo,Japan,IEEE Computer Society,May 2008
[2] Robert D,Gardner David A.Network Fault Detection:A Simplified Approach to Alarm Correlation[C]∥Proceedings of XVI World Telecom Congress,university of Strathclyde.1997:115-123
[3] Harrison K.Event Correlation in Telecommunication Network Management[R].Hewlett-Packard Labs,Bristol,1994
[4] Lewis L.A Case-based Reasoning Approach to the Managementof Faults in Communication Networks[C]∥Proceeding IEEE Infocom’93,vol.3.San Francisco,1993:114-120
[5] Lewis L.Implementing Policy in Enterprise Network[J].IEEE Communications Magazine,1996,34(1):50-55
[6] Jakobson G,Weissman M.Alarm Correlation[J].IEEE Net-work,1993,7(6):52-59
[7] Gabriele S,Chiaravalloti E,D’Aquila Q,et al.Distributed real-time monitoring system to natural hazard evaluation and management:the AMAMiR system[C]∥Proceedings of World IMACS|MODSIM Congress.2009
[8] White W,Riedewald M,Gehrke J.What is "next" in event pro-cessing[C]∥Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems.New York,NY,USA,2007:263-272
[9] 岳海涛.基于事件关联和数据挖掘的网络故障管理技术的研究[D].长沙:中南大学,2010

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!