计算机科学 ›› 2017, Vol. 44 ›› Issue (12): 131-134.doi: 10.11896/j.issn.1002-137X.2017.12.026

• 软件与数据库技术 • 上一篇    下一篇

基于多核字典学习的软件缺陷预测

王铁建,吴飞,荆晓远   

  1. 武汉大学计算机学院软件工程国家重点实验室 武汉430072,南京邮电大学自动化学院 南京210023,武汉大学计算机学院软件工程国家重点实验室 武汉430072
  • 出版日期:2018-12-01 发布日期:2018-12-01

Multiple Kernel Dictionary Learning for Software Defect Prediction

WANG Tie-jian, WU Fei and JING Xiao-yuan   

  • Online:2018-12-01 Published:2018-12-01

摘要: 提出一种多核字典学习方法,用以对软件模块是否存在缺陷进行预测。用于软件缺陷预测的历史数据具有结构复杂、类不平衡的特点,用多个核函数构成的合成核将这些数据映射到一个高维特征空间,通过对多核字典基的选择,得到一个类别平衡的多核字典,用以对新的软件模块进行分类和预测,并判定其中是否存在缺陷。在NASA MDP数据集上的实验表明,与其他软件缺陷预测方法相比,多核字典学习方法能够针对软件缺陷历史数据结构复杂、类不平衡的特点,较好地解决软件缺陷预测问题。

关键词: 软件缺陷预测,多核学习,字典学习,类不平衡

Abstract: A multiple kernel dictionary learning approach for software defect prediction was proposed.Software historical defect data have complicated structure and marked characteristic of class-imbalance.Multiple kernel learning is an effective technique in the field of machine learning which can map the historical defect data to a higher-dimensional feature space and make them express better.We got a multiple kernel dictionary learning classifier which has the advantages of both multiple kernel learning and dictionary learning.The widely used datasets from NASA MDP datasets are employed as test data to evaluate the performance of all compared methods.Experimental results demonstrate the effectiveness of the proposed multiple kernel dictionary learning approach for the software defect prediction task.

Key words: Software detect prediction,Multiple kernel learning,Dictionary learning,Class-imbalance

[1] LYU M R.Software Reliability Engineering:A Roadmap[C]∥Future of Software Engineering.2007:153-170.
[2] LESSMANN S,BAESENS B,M UES C,et al.Benchmarkingclassification models for software defect prediction:A proposed framework and novel findings[J].IEEE Transaction Software Engineering,2008,34(4):485-496.
[3] NAM J,PANY S J,KIM S.Transfer Defect Learning[C]∥ International Confernece on Software Engineering.2013:382-391.
[4] MCCABE T J.A complexity measure[J].IEEE Transactions Software Engineering,1976,SE-2(4):308-320.
[5] HALSTEAD M H.Elements of Software Science (Operatingand programming systems series)[M].New York:Elsevier North-Holland,1977.
[6] KHOSHGOFTAAR M T,GAO K,SELIYA N.Attribute Selection and Imbalanced Data:Problems in Software Defect Prediction[C]∥IEEE International Conference on Tools with Artificial Intelligence.2010:137-144,.
[7] GAO K,KHOSHGOFTAAR T M, NAPOLITANO A.A Hybrid Approach to Coping with High Dimensionality and Class Imbalance for Software Defect Prediction[C]∥Machine Lear-ning and Applications.2012:281-288.
[8] MULLER K R,MIKA S,RATSCH G,et al.An introduction to kernel based learning algorithms[J].IEEE Transactions on Neural Networks,2001,12(2):181-201.
[9] RAKOTOMAMONJY A,BACH F,CANU S.More efficiency in multiple kernel learning[C]∥International Conference on Machine Learning.2007:775-782.
[10] GEHLER P V,NOWOZIN S.On Feature Combination for Multiclass Object Classification[C]∥IEEE 12th International Conference on Computer Vision.2009:221-228.
[11] KEMBHAVI A,SIDDIQUIE B,MIEZIANKO R.IncrementalMultiple Kernel Learning for Object Recognition[C]∥IEEE 12th International Conference on Computer Vision.2009:638-645.
[12] YANG M,ZHANG L,FENG X,et al.Fisher DiscriminationDictionary Learning for sparse representation[C]∥IEEE International Conference on Computer Vision.2011:543-550.
[13] JING X Y,YING S,ZHANG Z W,et al.Dictionary learningbased software defect prediction[C]∥International Conference on Software Engineering.2014:792-802.
[14] ELISH K,ELISH M.Predicting Defect-prone Software Modules Using Support Vector Machines[J].Journal Systems and Software,2008,81(5):649-660.
[15] TURHAN B,BENER A.Analysis of nave bayes’ assumptions on software fault data:An empirical study[J].Data Knowledge Engineering,2009,68(2):278-290.
[16] WANG J,SHEN B J,CHEN Y T.Compressed C4.5 Models for Software Defect Prediction[C]∥International Conference on Quality Software.2012:13-16.
[17] SUN Z B,SONG Q B,ZHU X Y.Using Coding Based Ensemble Learning to Improve Software Defect Prediction[J].IEEE Transactions on Systems Man and Cybernetics Part C,2012,42(6):1806-1817.
[18] MA G,LUO C,CHEN H.Kernel Based Asymmetric Learning for Software Defect Prediction[J].IEICE Transactions on Information and Systems,2012,E95-D(1):215-226.
[19] ZHENG J.Cost-sensitive boosting neural networks for software defect prediction[J].Expert Systems with Applications,2010,37(6):4537-4543.
[20] ROSASCO L,VERRI A,SANTORO M,et al.Iterative projection methods for structured sparsity regularization.http://dspace.mit.edu/bitstream/handle/1721.1/49428/MIT-CSAIL-TR-2009-050.pdf?sequence=1.
[21] YANG M,ZHANG L,YANG J,et al.Metaface learning for sparse representation based face recognition[C]∥International Conference on Image Processing.2010:1601-1604.
[22] ZHANG D,YANG M,FENG X,Sparse representation or collabortive representation:which helps face recognition?[C]∥IEEE International Conference on Computer Vision.2011:471-478.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!