Computer Science ›› 2022, Vol. 49 ›› Issue (11A): 210800189-7.doi: 10.11896/jsjkx.210800189

• Interdiscipline & Application • Previous Articles     Next Articles

Scalable Parallel Computing Method for Conditional Likelihood Probability of Nucleotide Molecular Phylogenetic Tree Based on GPU

HUANG Jia-wei, LI Xiao-peng, LING Cheng   

  1. School of Information Science and Technology,Beijing University of Chemical Technology,Beijing 100000,China
  • Online:2022-11-10 Published:2022-11-21
  • About author:HUANG Jia-wei,born in 1996,postgraduate.His main research interests include high-performance GPU computing in bioinformatics and so on.
    LING Cheng,born in 1987,Ph.D,associate professor.His main research inte-rests include high-performance GPU computing on computational biology and bioinformatics.
  • Supported by:
    National Natural Science Foundation of China(61602026).

Abstract: The efficient implementation of Bayesian and Metropolis Hastings algorithms makes Mrbayes a widely used tool for molecular sequence phylogenetic analysis.However,the increase of molecular sequences and evolutionary parameters leads to the rapid expansion of the sample space of candidate molecular trees,which makes the reconstruction of phylogenetic trees face great computational challenges.In order to reduce the calculation time of conditional likelihood probability of molecular tree in mrbayes phylogenetic analysis and improve the analysis efficiency,a number of parallel acceleration methods based on graphics processor(GPU) have emerged in recent years.In order to improve the scalability of parallel methods,an optimized likelihood probability multithreaded parallel computing method is proposed in this paper.As the calculation of molecular state likelihood probability in the variable evolution rate model between sites needs to correspond to different transition probability matrices,this method further decomposes the parallel calculation of likelihood probability of different sites using multithreading into the calculation of conditional likelihood probability under different transition probability matrices between multiple sites.This strategy optimizes the parallel overlap between threads and improves the parallel efficiency by increasing the number of threads without changing the calculation transmission ratio of a single thread.In addition,because each thread warp only calculates the likelihood probability under the same transition probability matrix,it avoids the synchronization overhead between different warps when using shared memory,and further improves the computing efficiency of the kernel.Calculation results of 4 groups of actual data and 30 groups of simulated data show that the computational performance of this method is 1.78 and 2.04 times higher than that of tgMC3(version 2.0) and nMC3(version 2.1.1) in the calculation acceleration of core likelihood function.

Key words: MrBayes, Likelihood computing, GPU, Parallel computing, CUDA

CLC Number: 

  • TP399
[1]ARROWSMITH C,BOUNTRA C,FISH P,et al.Epigeneticprotein families:a new frontier for drug discovery.Nature Reviews Drug Discovery,2012,11(5):384-400.
[2]RAJAPAKSA S,RASANJANA W,PERERA I,et al.GPU Accelerated Maximum Likelihood Analysis for Phylogenetic Infe-rence[P].Software and Computer Applications,2019.
[3]GUINDON S,GASCUEL O.A Simple,Fast,and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood[J].Systematic Biology,2003,52(5):696-704.
[4]YANG Z,RANNALA B.Bayesian phylogenetic inference using DNA sequences:a markov chain monte carlo method[J].Mole-cular Biology & Evolution,1997(7):717-724.
[5]HASTINGS W K.Monte Carlo Sampling Methods Using Mar-kov Chains and Their Applications[J].Biometrika,1970,57(1):97-107.
[6]GREEN P J.Reversible Jump Markov Chain Monte Carlo Computation andBayesian Model Determination[J].Biometrika,1995,82(4):711-732.
[7]METROPOLIS N,ROSENBLUTH A W,ROSENBLUTH MN,et al.Equation of State by Fast Computing Machines[J].Journal of Chemical Physics,1953,21(6):1087-1092.
[8]PRATAS F,TRANCOSO P,STAMATAKIS A,et al.Fine-grain parallelism using multi-core,Cell/BE,and GPU systems:Accelerating the phylogenetic likelihood function[C]//2009 International Conference on Parallel Processing.IEEE,2009:9-17.
[9]PANG S A,STONES R J,REN M M,et al.GPU MrBayes V3.1:MrBayes on Graphics Processing Units for Protein Sequence Data.[J].Molecular Biology and Evolution,2015,32(9):2496-2497.
[10]ZHAO M,REN Q,WANG Y,et al.A Three-Level Parallel Algorithm for MrBayes 3.2[C]//2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications(ISPA/IUCC).IEEE,2017:1246-1250.
[11]PRATAS F,TRANCOSO P,SOUSA L,et al.Fine-grain parallelism using multi-core,Cell/BE,and GPU Systems[J].Parallel Computing,2012,38(8):365-390.
[12]ZHOU J F,LIU X G,STONES D S,et al.MrBayes on a Gra-phics Processing Unit[J].Bioinformatics,2011,27(9):1-7.
[13]BAO J,XIA H J,ZHOU J F,et al.Efficient implementation of MrBayes on multi-GPU[J].Molecular Biology and Evolution,2013,30(6):1471-1479.
[14]LING C,HAMADA T,BAI J N,et al.MrBayes tgMC3:ATight GPU Implementation of MrBayes[J].PLOS ONE,2013,8(4):1-9.
[15]LING C,HAMADA T,GAO J Y,et al.MrBayes tgMC3++:A High Performance and Resource-Efficient GPU-Oriented Phylogenetic Analysis Method[J].IEEE/ACM Transactions on Computational Biology and Bioinformatics,2016,13(5):1-9.
[16]KUAN L,PRATAS F,SOUSA L,et al.MrBayes sMC3:Accelerating Bayesian inference of phylogenetic trees[J].The International Journal of High Performance Computing Applications,2018,32(2):754-755.
[17]AYRES D L,DARLING A,ZWICKL D J,et al.BEAGLE:An Application Programming Interface and High-Performance Computing Library for Statistical Phylogenetics[J].Systematic Bio-logy,2012,61(1):170-173.
[18]AYRES DANIEL L,CUMMINGS MICHAEL P,BAELEG,et al.BEAGLE 3:Improved Performance,Scaling,and Usability for a High-Performance Computing Library for Statistical Phylogenetics.[J].Systematic Biology,2019,68(6):1052-1061.
[19]YANG Z.Maximum likelihood phylogenetic estimation fromDNA sequences with variable rates over sites:approximate methods[J].Journal of molecular evolution,1994,39(3):306-314.
[20]ANDREW R,NICHOLAS C.Grass.Seq-Gen:an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees[J].Bioinformatics,1997,13(3):235-238.
[1] ZHAI Xulun, ZHANG Yongguang, JIN Anzhao, QIANG Wei, LI Mengbing. Parallel DVB-RCS2 Turbo Decoding on Multi-core CPU [J]. Computer Science, 2023, 50(6): 22-28.
[2] WANG Zhuang, WANG Pinghui, WANG Bincheng, WU Wenbo, WANG Bin, CONG Pengyu. GPU Shared Scheduling System Under Deep Learning Container Cloud Platform [J]. Computer Science, 2023, 50(6): 86-91.
[3] GUO Zheng-wei, FU Ze-wen, LI Ning, BAI Lan. Study on Acceleration Algorithm for Raw Data Simulation of High Resolution Squint Spotlight SAR [J]. Computer Science, 2022, 49(8): 178-183.
[4] ZONG Di-di, XIE Yi-wu. Model Medial Axis Generation Method Based on Normal Iteration [J]. Computer Science, 2022, 49(6A): 764-770.
[5] CHEN Xin, LI Fang, DING Hai-xin, SUN Wei-ze, LIU Xin, CHEN De-xun, YE Yue-jin, HE Xiang. Parallel Optimization Method of Unstructured-grid Computing in CFD for DomesticHeterogeneous Many-core Architecture [J]. Computer Science, 2022, 49(6): 99-107.
[6] WANG Jin, LIU Jiang. GPU-based Parallel DILU Preconditioning Technique [J]. Computer Science, 2022, 49(6): 108-118.
[7] ZHU Ruo-chen, YANG Chang-chun, ZHANG Deng-hui. EGOS-DST:Efficient Schema-guided Approach to One-step Dialogue State Tracking for Diverse Expressions [J]. Computer Science, 2022, 49(11A): 210900246-7.
[8] FU Tian-hao, TIAN Hong-yun, JIN Yu-yang, YANG Zhang, ZHAI Ji-dong, WU Lin-ping, XU Xiao-wen. Performance Skeleton Analysis Method Towards Component-based Parallel Applications [J]. Computer Science, 2021, 48(6): 1-9.
[9] HE Ya-ru, PANG Jian-min, XU Jin-long, ZHU Yu, TAO Xiao-han. Implementation and Optimization of Floyd Parallel Algorithm Based on Sunway Platform [J]. Computer Science, 2021, 48(6): 34-40.
[10] LI Fan, YAN Xing, ZHANG Xiao-yu. Optimization of GPU-based Eigenface Algorithm [J]. Computer Science, 2021, 48(4): 197-204.
[11] HU Rong, YANG Wang-dong, WANG Hao-tian, LUO Hui-zhang, LI Ken-li. Parallel WMD Algorithm Based on GPU Acceleration [J]. Computer Science, 2021, 48(12): 24-28.
[12] WEN Min-hua, WANG Shen-peng, WEI Jian-wen, LI Lin-ying, ZHANG Bin, LIN Xin-hua. DGX-2 Based Optimization of Application for Turbulent Combustion [J]. Computer Science, 2021, 48(12): 43-48.
[13] MA Meng-yu, WU Ye, CHEN Luo, WU Jiang-jiang, LI Jun, JING Ning. Display-oriented Data Visualization Technique for Large-scale Geographic Vector Data [J]. Computer Science, 2020, 47(9): 117-122.
[14] CHEN Guo-liang, ZHANG Yu-jie, . Development of Parallel Computing Subject [J]. Computer Science, 2020, 47(8): 1-4.
[15] YANG Wang-dong, WANG Hao-tian, ZHANG Yu-feng, LIN Sheng-le, CAI Qin-yun. Survey of Heterogeneous Hybrid Parallel Computing [J]. Computer Science, 2020, 47(8): 5-16.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!