计算机科学 ›› 2016, Vol. 43 ›› Issue (8): 1-6.doi: 10.11896/j.issn.1002-137X.2016.08.001

• 目次 •    下一篇

服务器监控技术综述及展望

王慧强,戴秀豪,吕宏武,林俊宇   

  1. 哈尔滨工程大学 哈尔滨150001,哈尔滨工程大学 哈尔滨150001,哈尔滨工程大学 哈尔滨150001,哈尔滨工程大学 哈尔滨150001
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然基金(61370212,61402127),博士点基金优先发展领域项目(20122304130002),黑龙江省自然科学基金项目(ZD201102,F2015029),中央高校基本科研业务费专项资金项目(HEUCF100601)资助

Review and Prospect of Server Monitoring Technology

WANG Hui-qiang, DAI Xiu-hao, LV Hong-wu and LIN Jun-yu   

  • Online:2018-12-01 Published:2018-12-01

摘要: 随着服务器在各个领域的广泛应用以及服务器规模的不断扩大,服务器监控技术对保障服务器长期有效地工作起着关键性作用。首先对服务器监控的需求进行了阐述;其次对心跳机制、智能平台管理接口(Intelligent Platform Management Interface,IPMI)、简单网络管理协议(Simple Network Management Protocol,SNMP)及虚拟化技术等关键技术进行了归纳和总结,在此基础上描述了国内外主流的监控产品,如IBM Tivoli、HP OpenView等,并对其功能进行了对比和分析;最后对监控技术的发展前景进行了展望,针对现有服务器对高可用性的需求,提出了一种面向高可用服务器的监控框架。

关键词: 服务器监控,心跳机制,智能平台管理接口,简单网络管理协议,高可用性

Abstract: With the extensive application of server in various fields and the scale of server being much huger,server monitoring technology plays a key role in ensuring long-term and effective working of server.This paper began with analyzing the requirement of server monitoring,and then summarized the related monitoring technology,including heartbeat mechanism,IPMI,SNMP and virtualization technology etc.Besides,this paper described domestic and international main-stream server monitoring products,such as IBM Tivoli,HP OpenView,then contrasted and analyzed their functions.At last,this paper predicted the development of server monitoring technology,and presented a monitoring framework for high availability server.

Key words: Server monitoring,Heartbeat mechanism,IPMI,SNMP,High availability

[1] Li F,Yu X,Wu G.Design and implementation of high availability distributed system based on multi-level heartbeat protocol[C]∥IITA International Conference on Control,Automation and Systems Engineering,2009(CASE 2009).IEEE,2009:83-87
[2] Shi Hong-bo.Research and Implementation of Key technology in Web Network Management System Based on SNMP[D].Nanjing:Nanjing University of Posts and Telecommunications,2011(in Chinese) 师鸿博.基于SNMP协议的Web监控系统[D].南京:南京邮电大学,2011
[3] Li Na.Design and Implementation of a server management system based on IPMI[D].Beijing:Beijing University of Posts and Telecommunications,2009(in Chinese) 李娜.基于IPMI技术的服务器管理系统的设计与实现[D].北京:北京邮电大学,2009
[4] Wang Z,Li X.A new real-time heartbeat failure detector[C]∥4th International Conference on Wireless Communications,Networking and Mobile Computing,2008(WiCOM’08).IEEE,2008:1-3
[5] Liang Jiao.Research and design of fault diagnosis method for high-performance seve[D].Harbin:Harbin Institute of Technology,2011(in Chinese) 梁佼.高性能服务器故障诊断方法的研究与设计[D].哈尔滨:哈尔滨工业大学,2011
[6] Hu Zhi-kun,He Duo-chang,Gui Wei-hua.Remote monitoringsystem of rectifier based on improved heartbeat mechanism[J].Computer Applications,2008,28(2):363-366(in Chinese) 胡志坤,何多昌,桂卫华,等.基于改进心跳包机制的整流远程监控系统[J].计算机应用,2008,28(2):363-366
[7] Yang Zhao-jun.Analysis and design of network centralized monitoring system based on SNMP[D].Beijing:Beijing University of Posts and Telecommunications,2011(in Chinese) 杨召军.基于SNMP协议的网络集中监控系统分析与设计[D].北京:北京邮电大学,2011
[8] Wei Yu-xin,Li Qiang.Data collect method of snmp-based network performance management[J].Computer Engineering and Applications,2011,47(2):105-107(in Chinese) 魏煜欣,李强.一种基于SNMP网络性能管理数据的采集方法[J].计算机工程与应用,2011,47(2):105-107
[9] Narayanan H T S,Loganathan P S,Narayanan V K.A Study on the Effectiveness of SNMP OID Compression[J].Journal of Network and Systems Management,2011,19(4):496-512
[10] Ding Y Z,Zhen H.A Study and Realization on Searching the SNMP Agent Based on BER[C]∥2011 International Conference on Control,Automation and Systems Engineering (CASE).IEEE,2011:1-3
[11] Yu Z,Ji H.Research of IPMI Management based on BMC SOC[C]∥2010 International Conference on Management and Ser-vice Science (MASS).IEEE,2010:1-3
[12] Shao Wen-qing.Research and Implementation of Cloud Management Platform Resource Scheduling Strategy Based on Xen[D].Xi’an:Xidian University,2012(in Chinese) 邵文清.基于Xen的云管理平台下资源调度策略的研究与实现[D].西安:西安电子科技大学,2012
[13] Joshi N,Riley W,Schneider J,et al.Integration of domain-specific IT processes and tools in IBM Service Management[J].IBM Systems Journal,2007,46(3):497-511
[14] Xiao Hui-rong.Study and Design of Optimization Strategy ofHigh Availability in Virtualization[D].Beijing:Beijing University of Posts and Telecommunications,2015(in Chinese) 肖慧荣.虚拟化技术的高可用性机制优化策略的研究与设计[D].北京:北京邮电大学,2015
[15] Protocol.The HP OpenView Experts [EB/OL].http://www.protocolsoftware.com /hp-openview.php.2012
[16] IBM Corporation.IBM Tivoli Monitoring [M].IBM Corporation,2009
[17] Zitello T,Williams D,Weber P.HP OpenView System Administration Handbook:Network Node Manager,Customer Views,Service Information Portal,OpenView Operations[M].Prentice Hall PTR,2003
[18] Wallin S,Landen L.Telecom alarm prioritization using neuralnetworks[C]∥22nd International Conference on Advanced Information Networking and Applications-Workshops,2008(AINAW 2008).IEEE,2008:1468-1473
[19] Rao U H.Challenges of Implementing Network Management Solution[J].International Journal of Distributed and Parallel Systems (IJDPS),2011,2:67-76
[20] Mei Y,Liu L,Pu X,et al.Performance analysis of network I/O workloads in virtualized data centers[J].IEEE Transactions on Services Computing,2013,6(1):48-63
[21] Petrucci V,Carrera E V,Loques O,et al.Optimized management of power and performance for virtualized heterogeneous server clusters[C]∥2011 11th IEEE/ACM International Symposium on Cluster,Cloud and Grid Computing (CCGrid).IEEE,2011:23-32
[22] Chen W,Shang Z,Tian X,et al.Dynamic Server Cluster Load Balancing in Virtualization Environment with OpenFlow[J].International Journal of Distributed Sensor Networks, 2015,2015:1-8
[23] Aderholdt F,Han F,Scott S L,et al.Efficient Checkpointing of Virtual Machines Using Virtual Machine Introspection[C]∥IEEE International Symposium on Cluster,Cloud and Grid Computing.IEEE,2014:414-423
[24] Tan Cheng-xin,Wang Lei,Guan Yu-xin.Research on Micro-reboot Technology with Supporting Self-healing[J].Journal of Chinese Computer Systems,2013,34(1):77-82(in Chinese) 谭成鑫,王雷,关育新.支持自恢复的微重启技术研究[J].小型微型计算机系统,2013,34(1):77-82
[25] Bharadwaj S,Neema J,Salini S,et al.Dual server hot standby architecture for disaster recovery[R].Technical Report,III TB-TR-2012-04,2012
[26] Anand M.Cloud monitor:monitoring applications in cloud[C]∥2012 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM).IEEE,2012:1-4

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!