Computer Science ›› 2018, Vol. 45 ›› Issue (8): 160-165.doi: 10.11896/j.issn.1002-137X.2018.08.029

• Software & Database Technology • Previous Articles     Next Articles

Software Defect Prediction Based on Improved Deep Forest Algorithm

XUE Can-guan, YAN Xue-feng   

  1. College of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,China
  • Received:2017-07-11 Online:2018-08-29 Published:2018-08-29

Abstract: Software defect prediction is an important way to rationally use software testing resources and improve software performance.In order to solve the problem that the shallow machine learning algorithm cannot deeply mine the characteristics of software data,an improved deep forest algorithm named deep stacking forest (DSF) was proposed.This algorithm firstly adopts the random sampling method to transform the original features to enhance its feature expression ability,and then uses the stacking structure to performlayer-by-layer representation learning for the transform features.The deep stacking forest was applied for the defect prediction of Eclipse dataset.The experimental results show that the algorithm has significant improvement in the predicting performance and time efficiency than the deep forest.

Key words: Deep forest, Deep stacking forest, Random sampling, Software defect prediction, Stacking structure

CLC Number: 

  • TP311
[1]WANG Q,WU S J,LI M S.Software defect prediction[J].Journal of Software,2008,19(7):1565-1580.(in Chinese)王青,伍书剑,李明树.软件缺陷预测技术[J].软件学报,2008,19(7):1565-1580.
[2]CHEN X,GU Q,LIU W S,et al.Survey of static software defect prediction[J].Journal ofSoftware,2016,27(1):1-25.(in Chinese)陈翔,顾庆,刘望舒,等.静态软件缺陷预测方法研究[J].软件学报,2016,27(1):1-25.
[3]JINDAL R,MALHOTRA R,JAIN A.Software defect prediction using neural networks[C]∥International Conference on Reliability,INFOCOM Technologies and Optimization.IEEE,2015:1-6.
[4]OKUTAN A,YILDIZ O T.Software defect prediction usingBayesian networks[J].Empirical Software Engineering,2014,19(1):154-181.
[5]KALAI M R,GRACIA JACOB S.Improved Random ForestAlgorithm for Software Defect Prediction through Data Mining Techniques[J].International Journal of Computer Applications,2015,117(23):18-22.
[6]WANG T,ZHANG Z,JING X,et al.Multiple kernel ensemble learning for software defect prediction[J].Automated Software Engineering,2016,23(4):1-22.
[7]THANGAVEL M,NASIRA G M.Support Vector Machine for Software Defect Prediction[J].International Journal of Applied Engineering Research,2014,9(24):25633-25644.
[8]SUN Z J,XUE L,XU Y M,et al.Overview of deep learning[J].Application Research of Computers,2012,29(8):2806-2810.(in Chinese)孙志军,薛磊,许阳明,等.深度学习研究综述[J].计算机应用研究,2012,29(8):2806-2810.
[9]HINTON G E,SALAKHUTDINOV R R.Reducing the dimensionality of data with neural networks[J].Science,2006,313(5786):504.
[10]DENGL,DONG Y.Deep Learning:Methods and Applications[M].Now Publishers Inc.,2014.
[11]ZHOU Z H,FENG J.Deep Forest:Towards An Alternative to Deep Neural Networks[C]∥IJCAI-17.2017:3553-3559.
[12]BREIMAN L.Random Forests[J].Machine Learning,2001,45(1):5-32.
[13]LIU F T,TING K M,YU Y,et al.Spectrum of variable-random trees[J].Journal of Artificial Intelligence Research,2008,32(1):355-384.
[14]WOLPERT D H.Stacked Generalization[J].Neural Networks,1992,5(2):241-259.
[15]ZIMMERMANN T,PREMRAJ R,ZELLER A.Predicting Defects for Eclipse[C]∥International Workshop on Predictor MODELS in Software Engineering.IEEE,2007:9.
[1] ZHENG Xiao-meng, GAO Meng, TENG Jun-yuan. Research on Construction Method of Defect Prediction Dataset for Spacecraft Software [J]. Computer Science, 2021, 48(6A): 575-580.
[2] TENG Jun-yuan, GAO Meng, ZHENG Xiao-meng, JIANG Yun-song. Noise Tolerable Feature Selection Method for Software Defect Prediction [J]. Computer Science, 2021, 48(12): 131-139.
[3] WANG Xiao-xiao, WANG Ting-wen, MA Yu-ling, FAN Jia-yi, CUI Chao-ran. Credit Risk Assessment Method of P2P Online Loan Borrowers Based on Deep Forest [J]. Computer Science, 2021, 48(11A): 429-434.
[4] ZHANG De-gan, FAN Hong-rui, GONG Chang-le, GAO Jin-xin, ZHANG Ting, ZHAO Peng-zhen and CHEN Chen. New Method of Data Missing Estimation for Vehicle Traffic Based on Tensor [J]. Computer Science, 2020, 47(6A): 505-511.
[5] JIANG Peng-fei, WEI Song-jie. Classification and Evaluation of Mobile Application Network Behavior Based on Deep Forest and CWGAN-GP [J]. Computer Science, 2020, 47(1): 287-292.
[6] SUN Yong-yue, LI Hong-yan, ZHANG Jin-bo. RAISE:Efficient Influence Cost Minimizing Algorithm in Social Network [J]. Computer Science, 2019, 46(9): 59-65.
[7] GE Shao-lin, YE Jian, HE Ming-xiang. Prediction Model of User Purchase Behavior Based on Deep Forest [J]. Computer Science, 2019, 46(9): 190-194.
[8] HAN Hui,WANG Li-ming,CHAI Yu-mei,LIU Zhen. Text Sentiment Classification Based on Deep Forests with Enhanced Features [J]. Computer Science, 2019, 46(7): 172-179.
[9] QIU Shao-jian, CAIZi-yi, LU Lu. Cost-sensitive Convolutional Neural Network Model for Software Defect Prediction [J]. Computer Science, 2019, 46(11): 156-160.
[10] HU Meng-yuan, HUANG Hong-yun, DING Zuo-hua. Ensemble Model for Software Defect Prediction [J]. Computer Science, 2019, 46(11): 176-180.
[11] CHEN Xiang, WANG Qiu-ping. Multi-objective Supervised Defect Prediction Modeling Method Based on Code Changes [J]. Computer Science, 2018, 45(6): 161-165.
[12] YANG Jie, YAN Xue-feng and ZHANG De-ping. Cost-sensitive Software Defect Prediction Method Based on Boosting [J]. Computer Science, 2017, 44(8): 176-180.
[13] GAN Lu, ZANG Lie and LI Hang. Deep Belief Network Software Defect Prediction Model [J]. Computer Science, 2017, 44(4): 229-233.
[14] XIONG Jing, GAO Yan and WANG Ya-yu. Software Defect Prediction Model Based on Adaboost Algorithm [J]. Computer Science, 2016, 43(7): 186-190.
[15] WU Yang,MA Yun-fei,WANG Tao and XING Meng. Independent Identification of Encrypted Data in Data Link Layer Based on Randomness Test [J]. Computer Science, 2014, 41(8): 192-196.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!