Computer Science ›› 2022, Vol. 49 ›› Issue (2): 265-271.doi: 10.11896/jsjkx.201100132

• Artificial Intelligence • Previous Articles     Next Articles

Ensemble Regression Decision Trees-based lncRNA-disease Association Prediction

REN Shou-peng1, LI Jin1, WANG Jing-ru1, YUE Kun2   

  1. 1 School of Software,Yunnan University,Kunming 650091,China
    2 School of Information Science & Engineering,Yunnan University,Kunming 650091,China
  • Received:2020-11-18 Revised:2021-05-30 Online:2022-02-15 Published:2022-02-23
  • About author:REN Shou-peng,born in 1997,master.His main research interests include bioinformatics and machine learning.
    LI Jin,born in 1975,Ph.D,professor.His main research interests include machine learning and bioinformatics.
  • Supported by:
    Foundation of National Natural Science Foundation of China United Yunnan Province(U1802271),Foundation of Outstanding Youth Project of Basic Research in Yunnan Province(2019FJ011) and Foundation of Key Project of Basic Research in Yunnan Province(201901BB050052).

Abstract: Long non-coding RNA (lncRNA) plays an important role in various complex human diseases.The development of effective prediction methods to infer the potential associations between lncRNA and diseases will not only help biologists understand the pathogenesis of diseases,but also contribute to the diagnosis,prevention,and treatment of human diseases.In this paper,an ensemble regression decision tree-based lncRNA-disease association method (ERDTLDA) is proposed to solve the lncRNA-disease association problem.First,ERDTLDA uses the open-source data of lncRNA to construct lncRNA,disease similarity matrix,lncRNA-disease association matrix respectively.Then,we obtain lncRNA,disease feature representations from these matrices.Principal component analysis is further exploited for feature extraction.Finally,a CART regression decision tree is used to yield association scores.An ensemble strategy for multiple decision trees is proposed to further improve the accuracy of our model.The results of LOOCV experiments show that the AUC of our method on three real lncRNA-disease datasets are 0.905 5,0.896 9 and 0.912 9 respectively,which are 6.46%,5.4% and 6.02% higher than the existing methods,respectively.Additionally,breast cancer,lung cancer,and gastric cancer are also used as case studies to further verify the accuracy and effectiveness of ERDTLDA.

Key words: Association prediction, CART decision tree, Ensemble learning, Feature extraction, lncRNA-disease

CLC Number: 

  • TP391
[1]HUTTENHOFER A,SCHATTNER P,POLACEK N.Non-coding RNAs:hope or hype?[J].Trends in Genetics,2005,21(5):289-297.
[2]GEISLER S,COLLER J.RNA in unexpected places:Long non- coding RNA functions in diverse cellular contexts[J].Nature Reviews Molecular Cell Biology,2013,14(11):699-712.
[3]CHEN X,YAN C C,ZHANG X,et al.Long non-coding RNAsand complex diseases:From experimental results to computational models[J].Briefings in Bioinformatics,2016,18(4):558-576.
[4]SUN J,SHI H B,WANG Z Z,et al.Infering novel lncRNA-di-sease asociations based on a random walk model of a lncRNA functional similarity network[J].Molecular BioSystems,2014,10(8):2074-2081.
[5]GU C,LI X Y,CAI L J,et al.Global network random walk for predicting potential human lncRNA-disease associations[J].Sci. Rep.,2017,7(1):12442-12453.
[6]WEN Y,HAN G,ANH V.Laplacian normalization and bi-random walks on heterogeneous networks for predicting lncRNA-disease associations[J].BMC Systems Biology,2018,12(9),122-131.
[7]CHEN X,YAN G Y.Novel human lncRNA-disease associationinference based on lncRNA expression profiles[J].Bioinforma-tics,2013,29(20):2617-2624.
[8]CHEN X,YANG C G,LUO C,et al.Constructing lncRNAfunctional similarity network based on lncRNA-disease associations and disease semantic similarity[J].Scientific Reports,2015,5:11338-11350.
[9]ZHAO T T,XU J Y,LIU L,et al.Identification of cancer-rela-ted lncRNAs through integrating genome,regulome and transcriptome features[J].Molecular BioSystems,2014,11(1):126-136.
[10]XUAN P,PAN S,ZHANG T,et al.Graph Convolutional Network and Convolutional Neural Network Based Method for Predicting lncRNA-Disease Associations[J].Cells,2019,8(9),1012:1-16.
[11]WANG M N,YOU Z H,WANG L.LDGRNMF:LncRNA-di-sease associations prediction based on graph regularized non-ne-gative matrix factorization[J].Neurocomputing,2021,424:236-245.
[12]LIU J X,CUI Z,GAO Y L,et al.WGRCMF:A Weighted Graph Regularized Collaborative Matrix Factorization Method for Predicting Novel LncRNA-Disease Associations[J].IEEE Journal of Biomedical and Health Informatics,2021,25(1):257-265.
[13]WEI H,LIAO Q,LIU B.iLncRNAdis-FB:identify lncRNA-di-sease associations by fusing biological feature blocks through deep neural network[J].IEEE/ACM Transactions on Computational Biology and Bioinformatics/IEEE,ACM,2020(99):1-13.
[14]MA Y,GUO X L,SUN Y T,et al.Prediction of Disease Asso-ciated Long Non-Coding RNA Based on HeteSim[J].Journal of Computer Research and Development,2019,56(9):1889-1896.
[15]CHEN G,WANG Z Y,WANG D Q,et al.LncRNADisease:A database for long-non-coding RNA-asociated diseases[J].Nuc-leic Acids Research,2012,41(D1):D983-D986.
[16]NING S,ZHANG J,PENG W,et al.Lnc2Cancer:A manualycurated database of experimentaly supported lncRNAs asociated with various human cancers[J].Nucleic Acids Research,2015,44(D1):D980-D985.
[17]PENG H,LAN C W,LIU Y S,et al.Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes[J].Oncotarget,2017,8(45):78901-78916.
[18]BREIMAN L,FRIEDMAN J,STONE C J,et al.Classification and regression trees[M].CRC Press,1984:1-18.
[19]FRIENDENSON B.The BRCA1/2 pathway prevents hemato-logic cancers in addition to breast and ovarian cancers[J].BMC Cancer,2007,7(1):152-162.
[1] ZHANG Yuan, KANG Le, GONG Zhao-hui, ZHANG Zhi-hong. Related Transaction Behavior Detection in Futures Market Based on Bi-LSTM [J]. Computer Science, 2022, 49(7): 31-39.
[2] ZENG Zhi-xian, CAO Jian-jun, WENG Nian-feng, JIANG Guo-quan, XU Bin. Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism [J]. Computer Science, 2022, 49(7): 106-112.
[3] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[4] LIU Wei-ye, LU Hui-min, LI Yu-peng, MA Ning. Survey on Finger Vein Recognition Research [J]. Computer Science, 2022, 49(6A): 1-11.
[5] LIN Xi, CHEN Zi-zhuo, WANG Zhong-qing. Aspect-level Sentiment Classification Based on Imbalanced Data and Ensemble Learning [J]. Computer Science, 2022, 49(6A): 144-149.
[6] KANG Yan, WU Zhi-wei, KOU Yong-qi, ZHANG Lan, XIE Si-yu, LI Hao. Deep Integrated Learning Software Requirement Classification Fusing Bert and Graph Convolution [J]. Computer Science, 2022, 49(6A): 150-158.
[7] WANG Yu-fei, CHEN Wen. Tri-training Algorithm Based on DECORATE Ensemble Learning and Credibility Assessment [J]. Computer Science, 2022, 49(6): 127-133.
[8] HAN Hong-qi, RAN Ya-xin, ZHANG Yun-liang, GUI Jie, GAO Xiong, YI Meng-lin. Study on Cross-media Information Retrieval Based on Common Subspace Classification Learning [J]. Computer Science, 2022, 49(5): 33-42.
[9] GAO Yuan-hao, LUO Xiao-qing, ZHANG Zhan-cheng. Infrared and Visible Image Fusion Based on Feature Separation [J]. Computer Science, 2022, 49(5): 58-63.
[10] ZUO Jie-ge, LIU Xiao-ming, CAI Bing. Outdoor Image Weather Recognition Based on Image Blocks and Feature Fusion [J]. Computer Science, 2022, 49(3): 197-203.
[11] CHEN Wei, LI Hang, LI Wei-hua. Ensemble Learning Method for Nucleosome Localization Prediction [J]. Computer Science, 2022, 49(2): 285-291.
[12] LIU Zhen-yu, SONG Xiao-ying. Multivariate Regression Forest for Categorical Attribute Data [J]. Computer Science, 2022, 49(1): 108-114.
[13] ZHOU Xin-min, HU Yi-gui, LIU Wen-jie, SUN Rong-jun. Research on Urban Function Recognition Based on Multi-modal and Multi-level Data Fusion Method [J]. Computer Science, 2021, 48(9): 50-58.
[14] ZHANG Shi-peng, LI Yong-zhong. Intrusion Detection Method Based on Denoising Autoencoder and Three-way Decisions [J]. Computer Science, 2021, 48(9): 345-351.
[15] FENG Xia, HU Zhi-yi, LIU Cai-hua. Survey of Research Progress on Cross-modal Retrieval [J]. Computer Science, 2021, 48(8): 13-23.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!