计算机科学 ›› 2014, Vol. 41 ›› Issue (Z11): 382-386.

• 软件工程与数据库技术 • 上一篇    下一篇

程序控制流检测算法优化

李剑明,谭庆平,徐建军,尹胜   

  1. 75130部队133分队 贵港537103;国防科技大学计算机学院 长沙410073;国防科技大学计算机学院 长沙410073;75130部队政治部 贵港537103
  • 出版日期:2018-11-14 发布日期:2018-11-14

Optimization of Control Flow Checking Algorithm

LI Jian-ming,TAN Qing-ping,XU Jian-jun and YIN Sheng   

  • Online:2018-11-14 Published:2018-11-14

摘要: 暴露在太空辐射环境下的星载计算机,其电子元器件可能因受到高能带电粒子的轰击而造成硬件系统的瞬时故障,所以,需要使用容错技术来提高其可靠性。对一种面向硬件瞬时故障的纯软件控制流检测算法RSCFC(Relationship Signatures for Control Flow Checking)进行了有效的改进,通过对标签S进行分段编码,克服了原算法中存在的待加固程序的基本块总数受机器字长限制的问题,并给出了具体的计算证明。计算结果表明,如果机器字长为64位,那么改进后的算法在设定前提下能单层加固程序的最大基本块数可以超过218。与RSCFC相比,改进后的算法在加固基本块总数比较大的程序时,性能开销明显降低而且检错能力保持不变。

关键词: 瞬时故障,数字标签,控制流检测,软件容错

Abstract: In the space environment,the electrical circuses of computer are often subject to hardware transient faults,which are caused by high energy neutrons from cosmic rays.It is necessary to utilize appropriate fault tolerance techniques for improving the reliability of space application.This paper proposed an enhanced algorithm based on RSCFC(Relationship Signatures for Control Flow Checking),which is a control flow checking approach for hardware transient faults.In RSCFC,the sum of basic blocks is limited by the length of machine word.Through the segmented encoding of signatures,the optimized method solves the problem effectively.The analytical results indicate that the maximal number of basic blocks is extended to 218 when the length of machine word is 64 bits.Compared with RSCFC,the overhead of performance and memory is decreased evidently in our algorithm,and the faults detecting capability remains.

Key words: Transient faults,Software signatures,Control flow checking,Software tolerance

[1] Shivakumar P,Kistler M,Keckler S W,et al.Modeling the effect of technology trends on the soft error rate of combinational logic[C]∥Bethesdaed.Proceedings of the 2002 International Confe-rence on Dependable Systems and Networks(DSN 2002).Wa-shington:D.C,2002:389-399
[2] Siewiorek D P,Swarz R S.The Theory and Practice of ReliableSystem Design[M].Digital Press,1982
[3] lyer R K,Rossetti D J.A measurement—based model for workload dependence of CPU errors[J].IEEE Transactions on Computer,1986,35(6):511-519
[4] Bagchi S,Kalbarczyk Z,lyer R,et al.Hierarchical error detection in a SIFT environment[D].U of nlinois,2001
[5] Lyons R E,Vanderkulk W.The Use of Triple-Modular Redundancy to Improve Computer Reliability[J].IBM Journal of Research and Development,1962,6(2):200-209
[6] Pradhan D K.Fault-Tolerant Computer System Design[M].Prentice Hall,1996
[7] Lu D J.Watchdog Processor and Structural Integrity Checking[J].IEEE Transactions on Computers,1982,1(7):681-685
[8] Mahmood A,McCluskey E J.Concurrent error detection using watchdog processors-a survey[J].IEEE Transactions on Computers,1988,7(2):160-174
[9] Aho A,Sethi R,Ullman J.Compilers:Principles,Techniques and Tools(2nd)[M].Reading MA:Addison-Wesley,1986
[10] Oh N,Shirvani P P,McCluskey E J.Control-Flow Checking by Software Signatures[J].IEEE Transactions on Reliability,2002,51:111-122
[11] Reis G A,Chang J,Vachharajani N,et al.SWIFT:Software implemented fault tolerance.http://liberty.princeton.edu/publications.cg03-swift.pdf
[12] Nicolescu B,Savaria Y,Velazco R.Software Detection Mechanisms Providing Full Coverage Against Single Bit-Flip Faults[J].IEEE Transactions on Nuclear Science,2004,51(6):3510-3518
[13] Li Ai-guo,Hong Bing-rong.Software implemented transientfault detection in space computer[J].Aerospace Science and Technology,2007,11(2/3):245-252

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!