Computer Science ›› 2021, Vol. 48 ›› Issue (6A): 575-580.doi: 10.11896/jsjkx.200900133

• Interdiscipline & Application • Previous Articles     Next Articles

Research on Construction Method of Defect Prediction Dataset for Spacecraft Software

ZHENG Xiao-meng, GAO Meng, TENG Jun-yuan   

  1. Beijing Sunwise Information Technology Ltd.,Beijing 100190,China
    Beijing Institute of Control Engineering,Beijing 100190,China
  • Online:2021-06-10 Published:2021-06-17
  • About author:ZHENG Xiao-meng,born in 1986,postgraduate,engineer.Her main research interests include embedded software testing and software engineering.
  • Supported by:
    Equipment Pre-Research Field Fund Project(61400020407).

Abstract: As being the infrastructure of prediction model's construction and implementation,software defect prediction dataset faces two sets of problems.On the one hand,due to the difficulty of data collection from data sources,there are fewer available datasets.On the other hand,due to the difference of data in diverse fields and the inapplicability of software metrics standards,the published datasets are rarely applied in engineering.In this paper,combined with the real software testing data in the domestic space field,the method of spacecraft software metrics design and the construction process of spacecraft software defect prediction dataset are systematically expounded.According to the characteristics of the spacecraft software,a hybrid method combining the metrics based on code and quality of the software is proposed to ensure that the relevant characteristics of the spacecraft software can be described and measured comprehensively from different angles.At the same time,to solve the problem of high labor and storage cost for large-scale data collection,processing and analysis,a standardized dataset construction method combining the data cleaning process under version division and module hierarchical preprocessing is proposed.The dataset SPACE constructed based on this method is demonstrated,which proves that the method can be effectively applied to the construction of domain-specific high-quality software defect prediction dataset,and at the same time,good prediction effect of model AutoWeka can be obtained.

Key words: Data quality, Dataset, Software defect prediction, Software metrics, Spacecraft software

CLC Number: 

  • TP311
[1] HALL T,BEECHAM S,BOWES D,et al.A systematic literature review on fault prediction performance in software engineering[J].IEEE Trans.on Software Engineering,2012,38(6):1276-1304.
[2] CHEN X,GU Q,LIU W S,et al.Survey of static software defect prediction[J].Ruan Jian Xue Bao/Journal of Software,2016,27(1):1-25.
[3] WU F J.Research progress of static software defect prediction[J].Journal of Frontiers of Computer Science and Technology,2019,13(10):1621-1637.
[4] SHEPPERD M,SONG Q,SUN Z,et al.Data quality:some comments on the nasa software defect data sets[J].IEEE Trans.on Software Engineering,2013,39(9):1208-1215.
[5] GRAY D,BOWES D,DAVEY N,et al.Reflections on theNASA MDP data sets[J].IET S0ftware,2012,6(6):549-558.
[6] JURECZKO M,MADEYSKI L.Towards identifying softwareproject clusters with regard to defect prediction[C]//Proceedings of the 6th International Conference on Predictive Models in Software Engineering.Timisoara,2010,New York:ACM,2010:9.
[7] KHOSHGOFTAAR T M,GAO K,NAPOLITANO A,et al.A comparative study of iterative and non-iterative feature selection techniques for software defect prediction[J].Information Systems Frontiers,2013,16(5):1-22.
[8] WU R,ZHANG H,KIM S,et al.ReLink:recovering links between bugs and changes[C]//Proceedings of the Joint Meeting of the 19th ACM SIGSOFT Symposium and the13th European Conference on Foundations of Software Engineering,Szeged,2011.New York:ACM,2011:15-25.
[9] JING X,WU F,DONG X,et al.Heterogeneous cross-company defect prediction by unified metric representation and CCAbased transfer learning[C]//Proceedings of the Joint Meeting of the European Software Engineering Conference and the International Symposium on the Foundations of Software Engineering,Bergamo,2015.New York:ACM,2015:496-507.
[10] RADJENOVIC D,HERICKO M,TORKAR R,et al.Software fault prediction metrics:A systematic literature review[J].Information and Software Technology,2013,55(8):1397-1418.
[11] PUNITHA K,CHITRA S.Software defect prediction usingsoftware metrics-A survey[C]//International conference on Information Communication and Embedded Systems.Chennai,2013:555-558.
[12] AKIYAMA F.An example of software system debugging[C]//Proc.of the Int'l Federation of Information Proc.Societies Congress.New York:Springer Science and Business Media,1971:353-359.
[13] HALSTEAD M H.Elements of Software Science (Operatingand Programming Systems Series)[J].New York:Elsevier Science Inc.,1977.
[14] MCCABE T J.A complexity measure[J].IEEE Trans.on Software Engineering,1976,2(4):308-320.
[15] CHIDAMBER S R,KEMERER C F.A metrics suite for object oriented design[J].IEEE Trans.on Software Engineering,1994,20(6):476-493.
[16] SARKAR S,KAK A C,RAMA G M.Metrics for measuring the quality of modularization of large-scale object-oriented software[J].IEEE Trans.on Software Engineering,2008,34(5):700-720.
[17] FENTON N,BIEMAN J.Software metrics:a rigorous and practical approach(3rd ed)[M].Bosa Roca:CRC Press,2014:3-133.
[18] GJB5236-2004.Militray software quality metrics [S].Institute of China Aerospace Standardization,2004.
[19] GB/T 16260.1-2006.Software engineering-Product quality-Part 1:Quality model[S].Standardization Administration,2006.
[20] KAMEI Y,SHIHAB E,ADAMS B,et al.A large-scale empirical study of just-in-time quality assurance[J].IEEE Trans.on Software Engineering,2013,39(6):757-773.
[21] CAI L,FAN Y R,YAN M,et al.Just-in-time software defect prediction:literature review[J].Journal of Software,2019,30(5):1288-1307.
[1] DAI Yu, XU Lin-feng. Cross-image Text Reading Method Based on Text Line Matching [J]. Computer Science, 2022, 49(9): 139-145.
[2] YAN Min, LUO Xiao-qing, ZHANG Zhan-cheng. Infrared and Visible Image Fusion Network Based on Optical Transmission Model Learning [J]. Computer Science, 2022, 49(4): 215-220.
[3] ZHOU Hai-yu, ZHANG Dao-qiang. Multi-site Hyper-graph Convolutional Neural Networks and Application [J]. Computer Science, 2022, 49(3): 129-133.
[4] JIANG Hao-chen, WEI Zi-qi, LIU Lin, CHEN Jun. Imbalanced Data Classification:A Survey and Experiments in Medical Domain [J]. Computer Science, 2022, 49(1): 80-88.
[5] ZHANG Man, LI Jie, ZHU Xin-zhong, SHEN Ji, CHENG Hao-tian. Augmentation Technology of Remote Sensing Dataset Based on Improved DCGAN Algorithm [J]. Computer Science, 2021, 48(6A): 80-84.
[6] TENG Jun-yuan, GAO Meng, ZHENG Xiao-meng, JIANG Yun-song. Noise Tolerable Feature Selection Method for Software Defect Prediction [J]. Computer Science, 2021, 48(12): 131-139.
[7] ZHOU Yan, CHEN Shao-chang, WU Ke, NING Ming-qiang, CHEN Hong-kun, ZHANG Peng. SCTD 1.0:Sonar Common Target Detection Dataset [J]. Computer Science, 2021, 48(11A): 334-339.
[8] WANG Xiao-xiao, WANG Ting-wen, MA Yu-ling, FAN Jia-yi, CUI Chao-ran. Credit Risk Assessment Method of P2P Online Loan Borrowers Based on Deep Forest [J]. Computer Science, 2021, 48(11A): 429-434.
[9] XIE Yuan, MIAO Yu-bin, XU Feng-lin, ZHANG Ming. Injection-molded Bottle Defect Detection Using Semi-supervised Deep Convolutional Generative Adversarial Network [J]. Computer Science, 2020, 47(7): 92-96.
[10] YANG Lian-ping, SUN Yu-bo, ZHANG Hong-liang, LI Feng, ZHANG Xiang-de. Human Keypoint Matching Network Based on Encoding and Decoding Residuals [J]. Computer Science, 2020, 47(6): 114-120.
[11] CAI Qiang, DENG Yi-biao, LI Hai-sheng, YU Le, MING Shao-feng. Survey on Human Action Recognition Based on Deep Learning [J]. Computer Science, 2020, 47(4): 85-93.
[12] SHEN Qi, CHEN Yi-lun, LIU Shu, LIU Li-gang. 3D Object Detection Algorithm Based on Two-stage Network [J]. Computer Science, 2020, 47(10): 145-150.
[13] LI Zhuo, XU Zhe, CHEN Xin, LI Shu-qin. Location-related Online Multi-task Assignment Algorithm for Mobile Crowd Sensing [J]. Computer Science, 2019, 46(6): 102-106.
[14] ZHANG Fang, ZHAO Shu-liang, WU Yong-liang. Data Scaling Method for Multi-scale Data Mining [J]. Computer Science, 2019, 46(4): 57-65.
[15] WANG Yang, CAI Shu-qin, ZOU Xin-wen, CHEN Zi-tong. Quality-embedded Hypergraph Model for Big Data Product Manufacturing System and Decision for Production Lines [J]. Computer Science, 2019, 46(2): 11-17.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!