计算机科学 ›› 2015, Vol. 42 ›› Issue (2): 7-13.doi: 10.11896/j.issn.1002-137X.2015.02.002

• 目次 • 上一篇    下一篇

基于机器学习的microRNA预测方法研究进展

王颖,李金,王磊,徐成振,才忠喜   

  1. 哈尔滨工程大学自动化学院 哈尔滨150001;齐齐哈尔大学网络信息中心 齐齐哈尔161006,哈尔滨工程大学自动化学院 哈尔滨150001,哈尔滨工程大学自动化学院 哈尔滨150001,哈尔滨工程大学自动化学院 哈尔滨150001,哈尔滨工程大学自动化学院 哈尔滨150001
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家重大仪器专项(2012YQ0401401001),黑龙江省教育厅科学技术研究项目(12541898)资助

Research and Progress of microRNA Prediction Methods Based on Machine Learning

WANG Ying, LI Jin, WANG Lei, XU Cheng-zhen and CAI Zhong-xi   

  • Online:2018-11-14 Published:2018-11-14

摘要: 传统的克隆方法受组织和环境影响显著,且实验成本高,而计算方法中的比较方法对进化距离远的microRNA敏感性低,无法预测无同源的microRNA,机器学习方法解决了比较方法 依赖同源基因的问题。首先总结了基于机器学习预测microRNA的相关生物学知识;其次,给出基于机器学习的microRNA预测方法的大体流程,列举了基于机器学习的microRNA预测方法的最新研究算法及软件;再次,从数据集选取、特征集选取、分类器设计、特征子集选择、类不平衡问题解决和评价标准等环节出发,归纳总结了各环节中采用的方法及技术,并详细阐述了它们的最新研究进展,部分环节对采用的方法及技术进行了对比分析,总结了各自的优势和不足;最后,总结和展望了基于机器学习的microRNA预测方法的研究工作。

关键词: microRNA,机器学习,分类器,特征选取,类不平衡,生物信息学

Abstract: Traditional cloning experimental approaches are affected by the organizational and environmental impact,and the cost is high.The comparative method that belongs to the computational method is low sensitivity to the far evolutionary distance genes,and can’t predict the no homologous microRNAs.The machine learning method can resolve the restraints that comparative method dependents on the homologous gene.Firstly,this paper summarized the microRNA relevant biological knowledge which the machine learning is related to.Secondly,it outlined the general process and the latest research software and algorithms of machine learning based on microRNA prediction.Thirdly,starting from data selection,feature selection,classifier design,feature subset selection,class imbalance problem,performance evaluation and other aspects in terms of the essential elements of microRNA prediction based on the machine learning,it summarized the method and technology in each process,described their latest research progress.The approaches were contrasted and analyzed respectively in some process,and their respective advantages,disadvantages were summarized.Finally,summary and prospect of the research work on microRNA prediction based on machine learning were given.

Key words: microRNA,Machine learning,Classifier,Feature extraction,Class imbalance,Bioinformatics

[1] Baltimore D,Boldin M P,Connell M O,et al.MicroRNAs:new regulators of immune cell development and function[J].Nature,2008,9(8):839-845
[2] Song L,Tuan R S.MicroRNAs and cell differentiation in mammalian development[J].Birth Defects Research Part C:Embryo Today:Reviews,2006,78(2):140-149
[3] Bartel D P.MicroRNA:genomics,biogenesis,mechanism,andfunction[J].Cell,2004,6(2):281-297
[4] Hammond S M.MicroRNAs as oncogenes[J].Current opinionin genetics & development,2006,16:4-9
[5] Brennecke J,Hipfner D R,Stark A,et al.Cohen SM:bantam encodes a developmentally regulated microRNA that controls cell proliferation and regulates the proapoptotic gene hid in Drosophi-la[J].Cell,2003,113(1):25-36
[6] Rogaev E I.Small RNAs in human brain development and disor-ders[J].Biochemisry,2005,70 (12):1404
[7] Kozomara A,Griffiths-Jones S.miRBase:integrating microRNA annotation and deep-sequencing data[J].Nucleic Acids Res.,2009,39:D152-D157
[8] Büssing I,Slack F J,Grohans H,et al.let- microRNAs in development,stem cells and cancer[M].Trends Mol Med,2008
[9] Nielsen C B,Shomron N.Determinants of targeting by endogenous and exogenous microRNAs and siRNAs[J].RNA,2007,3:1894-1910
[10] Hertel J,Lindemeyer M,Missal K,et al.The expansion of themetazoan microRNA repertoire[J].BMC Genomics,2006:7-25
[11] Lee Y,Jeon K,Lee J T,et al.MicroRNA maturation:Stepwise processing and subcellular localization[J].The EMBO Journal,2002,21(17):4663-4670
[12] Eugene B.Evolution of microRNA diversity and regulation inanimals[J].Nature,2011:846-860
[13] Lee Y,Ahn C.The nuclear RNase III Drosha initiates microRNA processing[J].Nature,2011,425:415-419
[14] Elsebet L,Stephan G.Nuclear Export of MicroRNA Precur-sors[J].Science,2004,303:95-98
[15] Hutvágner G,Zamore P D.A microRNA in a Multiple-Turnover RNAi Enzyme Complex[J].Science,2002,7:2056-2060
[16] Reinhart B J,Bartel D P.Small RNAs Correspond to Centromere Heterochromatic Repeats[J].Science,2012,297(5588):1831
[17] Gregory R I,Chendrimada T P,Cooch N,et al.Human RISC couple microRNA biogenesis and posttranscriptional gene silencing[J].cell,2005,3:631-640
[18] Filipowicz W,Bhattacharyya S N,Sonenberg N.Mechanisms of post-transcriptional regulation by microRNAs:are the answers in sight? [J].Nature Reviews Genetics,2008,9:102-114
[19] Han J,Lee Y,Yeom K H,et al.Molecular basis for the recognition of primary microRNAs by the Drosha-DGCR8 complex[J].Cell,2009,125:887-901
[20] Helvik S A,Snove O,Strom P,et al.Reliable prediction ofDrosha processing sites improves microRNA gene prediction[J].Bioinformatics,2003,23:142-149
[21] He C,Li Y-X,Zhang G X,et al.MiRmat:Mature microRNA Sequence Prediction[J].PLoS ONE,2012,7(12):e51673
[22] Terai G,Okida H,Asai K,et al.Prediction of Conserved Precursors of microRNAs and Their Mature Forms by Integrating Position-Specific Structural Features[J].PLoS ONE,2012,7(9):e44314
[23] Xuan P,Guo M,Huang Y C,et al.MaturePred:Efficient Identification of MicroRNAs within Novel Plant Pre-microRNAs[J].PLoS ONE,2011,6(11):e27422
[24] Gkirtzou K,Tsamardinos I,Tsakalides P,et al.MatureBayes:A Probabilistic Algorithm for Identifying the Mature microRNA within Novel Precursors[J].PLoS ONE,2010,5(8):e11843
[25] Leclercq M,Diallo A B,Blanchette M.Computational prediction of the localization of microRNAs within their pre-microRNA[J].Nucleic Acids Research,2013,41(15):1-12
[26] Xue C,Li F,He T,et al.Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine[J].BMC Bioinformatics,2005(6):310
[27] Jiang P,Wu H,Wang W,et al.MiPred:classification of real and pseudo microRNA precursors using random forest prediction model with combined features[J].Nucleic Acids Research,2007,35:339-344
[28] Ng K L S,Mishra S K.De novo SVM classification of precursormicroRNAs from genomic pseudo hairpins using global and intrinsic folding measures[J].Bioinformatics,2007,23:1321-1330
[29] Batuwita R,Palade V.microPred:Effective classification of pre-microRNAs for human microRNA gene prediction[J].Bioinformatics,2009,25(8):989-995
[30] Liu X,He S,Skogerb G,et al.Integrated Sequence-StructureMotifs Suffice to Identify microRNA Precursors[J].PLoS ONE,2012,7(3):e32797
[31] Xuan P,Guo M Z,Liu X Y,et al.PlantMiRNAPred:efficient classification of real and pseudo plant pre-microRNAs[J].Bioinformatics,2011,27 (10):1368-1376
[32] Lertampaiporn S,Thammarongtham C,Nukoolkit C,et al.Heterogeneous ensemble approach with discriminative features and modified-SMOTE bagging for pre-microRNA classification[J].Nucleic Acids Research,2013,41:1-21
[33] Wu Y G,Wei B,Liu H Z,et al.MiRPara:a SVM-based software tool for prediction of most probable microRNA coding regions in genome scale sequences[J].BMC Bioinformatics,2011(12):107
[34] Guan D G,Liao J Y,Qu Z H,et al.mirExplorer Detecting microRNAs from genome and next generation sequencing data using the Adaboost method with transition probability matrix and combined features[J].RNA Biology,2011,8(5):922-934
[35] Agarwal S,Vaz C,Bhattacharya A,et al.Prediction of novel precursor microRNAs using a context-sensitive hidden Markov model (CSHMM)[J].BMC Bioinformatics,2010,11(Suppl 1):S29
No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!