Computer Science ›› 2022, Vol. 49 ›› Issue (2): 285-291.doi: 10.11896/jsjkx.201100195

• Artificial Intelligence • Previous Articles     Next Articles

Ensemble Learning Method for Nucleosome Localization Prediction

CHEN Wei, LI Hang, LI Wei-hua   

  1. School of Information Science and Engineering,Yunnan University,Kunming 650500,China
  • Received:2020-11-26 Revised:2021-04-19 Online:2022-02-15 Published:2022-02-23
  • About author:CHEN Wei,born in 1997,postgraduate.His main research interests include deep learning and bioinformatics.
    LI Wei-hua,corresponding author,born in 1977,Ph.D,associate professor.Her main research interests include data mining and bioinformatics.
  • Supported by:
    Scientific Research Fundation of the Education Department of Yunnan Province China(2019J0006) and Innovative Research Team of Yunnan Province,China(2018HC019).

Abstract: Nucleosome localization refers to the position of DNA double helix relative to histone,and plays an important regulatory role in DNA transcription.It takes a lot of time and resources to detect nucleosome localization by biological experiments.Therefore,it is an important research direction to predict nucleosome localization by using DNA sequences based on computationalmethods.Aiming at the shortcomings of single model and single code in DNA sequence feature representation and learning in nucleosome location prediction,this paper proposes an end-to-end ensemble deep learning model FuseENup,which uses three coding methods to represent DNA data from multiple dimensions.Different models extract the key features hidden in the data from different dimensions,and construct a new DNA sequence representation model.Performing 20-fold cross-validation on the four data sets,compared to the current model CORENup with the best comprehensive performance for the nucleosome localization prediction problem,the accuracy and precision of FuseENup are improved by 3% and 9% on the HS data set,increases 2% and 6% on the DM data set,1% and 4% on the E data set.Compared with other machine learning and deep learning benchmark models,FuseENup has better performance.Experiments show that FuseENup can improve the prediction accuracy of nucleosomes localization,which shows the effectiveness and scientificity of the method.

Key words: Cross-validation, Deep learning, DNA sequence coding, Ensemble learning method, Nucleosome localization

CLC Number: 

  • TP183
[1]RIDGWAY P,ALMOUZNI G.Chromatin assembly and organization[J].Journal of Cell Science,2001,114(15):2711-2712.
[2]MASKELL D P,RENAULT L,SERRAO E,et al.Structuralbasis for retroviral integration into nucleosomes[J].Nature,2015,523(7560):366-369.
[3]TABERLAY P C,STATHAM A L,KELLY T K,et al.Reconfiguration of nucleosome-depleted regions at distal regulatory elementsaccompanies DNA methylation of enhancers and insulators in cancer[J].Genome Research,2014,24(9):1421-1432.
[4]COLE H A,CUI F,OCAMPO J,et al.Novel nucleosomal particles containing core histones and linker DNA but no histone H1[J].Nucleic Acids Research,2016,44(2):573-581.
[5]SHAHBAZIAN M D,GRUNSTEIN M.Functions of Site-Specific Histone Acetylation and Deacetylation[J].Annual Review of Biochemistry,2007,76:75-100.
[6]SCHNITZLER G R.Control of nucleosome positions by DNAsequence and remodeling machines[J].Cell Biochemistry and Biophysics,2008,51(2/3):67-80.
[7]ZHENG D S,TRYNDA J,SUN Z F,et al.NUCLIZE for quantifying epigenome:generating histone modification data at single-nucleosome resolution using genuine nucleosome positions[J].Bmc Genomics,2019,20(1):541-544.
[8]BUITRAGO D,CODO L,ILLA R,et al.Nucleosome Dynamics:a new tool for the dynamic analysis of nucleosome positioning[J].Nucleic Acids Research,2019,47(18):9511-9523.
[9]SATCHWELL S C,DREW H R,TRAVERS A A.Sequence Periodicities in Chicken Nucleosome Core DNA[J].Journal of Molecular Biology,1986,191(4):659-675.
[10]DREW H R,TRAVERS A A.DNA Bending and Its Relation to Nucleosome Positioning[J].Journal of Molecular Biology,1985,186(4):773-790.
[11]LOWMAN H,BINA M.Correlation between Dinucleotide Pe-riodicities and Nucleosome Positioning on Mouse Satellite DNA[J].Biopolymers,1990,30(9/10):861-876.
[12]ZIVKOVIĆ V,STANKOVIĆ A,TATJANA C,et al.Anti-dsDNA,Anti-Nucleosome and Anti-C1q Antibodies as Disease Activity Markers in Patients with Systemic Lupus Erythematosus[J].Srpski Arhiv za Celokupno Lekarstvo,2014,142:431-436.
[13]YANG J F,XU Z Z,SUI M S,et al.Co-Positivity for Anti-dsDNA,-Nucleosome and-Histone Antibodies in Lupus Nephritis is Indicative of High Serum Levels and Severe Nephropathy[J].Plos One,2015,10(10):0140441.
[14]OLIVEIRA R C,OLIVEIRA I S,SANTIAGO M B,et al.High Avidity dsDNA Autoantibodies in Brazilian Women with Systemic Lupus Erythematosus:Correlation with Active Disease and Renal Dysfunction[J].Journal of Immunology Research,2015,2015:814748.
[15]CHENG J,CHEN H,MEN J L.Correlation between anti-nucleosome antibodies and systemic lupus erythematosus[J].Anhui Medicine,2019,23(1):83-86.
[16]ZHANG D F,MA Q Y,YIN T M.Third-generation sequencing technology and its application[J].Chinese Journal of Bioengineering,2013,33(5):125-131.
[17]XING Y Q,LIU G Q,ZHAO X J,et al.An analysis and prediction of nucleosome positioning based on information content[J].Chromosome Research,2013,21(1):63-74.
[18]STRUHL K,SEGAL E.Determinants of nucleosome positio-ning[J].Nature Structural & Molecular Biology,2013,20(3):267-273.
[19]LIELEG C,KRIETENSTEIN N,WALKER M,et al.Nucleo-some positioning in yeasts:methods,maps,and mechanisms[J].Chromosoma,2015,124(2):131-151.
[20]MEHER P K,SAHU T K,RAO A R.Identification of species based on DNA barcode using k-mer feature vector and Random forest classifier[J].Gene,2016,592(2):316-324.
[21]CHEN W,FENG P M,DING H,et al.Using deformation energy to analyze nucleosome positioning in genomes[J].Genomics,2016,107(2/3):69-75.
[22]BOSCO G L,RIZZO R,FIANNACA A,et al.A Deep Learning Model for Epigenomic Studies[C]//International Conference on Signal-image Technology & Internet-based Systems.2017.
[23]GANGI M A D,GAGLIO S,BUA C L,et al.A Deep Learning Network for Exploiting Positional Information in Nucleosome Related Sequences[J].Bioinformatics and Biomedical Enginee-ring,2017,10209(4):524-533.
[24]DI G M,LO B G,RIZZO R.Deep learning architectures for prediction of nucleosome positioning from sequences data[J].BMC Bioinformatics,2018,19(Suppl 14):418.
[25]ZHANG J,PENG W,WANG L.LeNup:learning nucleosome positioning from DNA sequences with improved convolutional neural networks[J].Bioinformatics,2018,34(10):1705-1712.
[1] XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[2] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[3] TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[4] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[5] WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[6] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[7] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[8] HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.
[9] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[10] HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.
[11] ZHOU Hui, SHI Hao-chen, TU Yao-feng, HUANG Sheng-jun. Robust Deep Neural Network Learning Based on Active Sampling [J]. Computer Science, 2022, 49(7): 164-169.
[12] SU Dan-ning, CAO Gui-tao, WANG Yan-nan, WANG Hong, REN He. Survey of Deep Learning for Radar Emitter Identification Based on Small Sample [J]. Computer Science, 2022, 49(7): 226-235.
[13] WANG Jun-feng, LIU Fan, YANG Sai, LYU Tan-yue, CHEN Zhi-yu, XU Feng. Dam Crack Detection Based on Multi-source Transfer Learning [J]. Computer Science, 2022, 49(6A): 319-324.
[14] CHU Yu-chun, GONG Hang, Wang Xue-fang, LIU Pei-shun. Study on Knowledge Distillation of Target Detection Algorithm Based on YOLOv4 [J]. Computer Science, 2022, 49(6A): 337-344.
[15] ZHU Wen-tao, LAN Xian-chao, LUO Huan-lin, YUE Bing, WANG Yang. Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN [J]. Computer Science, 2022, 49(6A): 378-383.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!