计算机科学 ›› 2014, Vol. 41 ›› Issue (2): 261-263.

• 人工智能 • 上一篇    下一篇

二代测序技术454测序仪模拟测序算法

陈伟,程咏梅,张绍武,潘泉   

  1. 西北工业大学自动化学院 西安710072;西北工业大学自动化学院 西安710072;西北工业大学自动化学院 西安710072;西北工业大学自动化学院 西安710072
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家自然基金重点项目(61135001),国家自然科学基金(61170134,60775012),航空基金(20100853010),西北工业大学博士论文创新基金(cx201017)资助

Simulation Algorithm for 454Pyrosequencing Sequencers

CHEN Wei,CHENG Yong-mei,ZHANG Shao-wu and PAN Quan   

  • Online:2018-11-14 Published:2018-11-14

摘要: 随着环境基因组学及深度测序技术的发展,基于16S rRNA基因序列研究微生物种群结构取得了长足进展。然而,由于环境样本的复杂性,尤其缺少真实背景信息,定量研究环境微生物种群结构仍是当前的研究难点。测序算法仿真平台研究,不仅有助于定量、定性分析微生物种群组成及结构,而且有助于建立基准数据库来评价当前微生物数据分析算法。分别基于易错PCR误差模型和正态分布过程,模拟454测序仪乳液PCR过程及边合成边测序过程,提出454测序仪模拟测序算法 (Tsim) 。仿真结果表明:该模拟算法能较好地模拟454测序过程。

关键词: 16S rRNA基因,模拟测序算法,PCR误差模型,正态分布过程,454测序仪 中图法分类号TP311.52文献标识码A

Abstract: Recent advance of environment genome and deep sequencing technologies has expanded our understanding of composition and structure of microbial community based on 16S rRNA gene sequences.However,the complexity and difficulty of separation of the environmental samples and lack of ground-truth make it difficult to analyze the microbes quantificationally.Thus,simulation datasets will be useful in developing novel softwares because it not only helps us explore the microbial structure quantitatively,but also allow us to construct benchmark studies for evaluating existing methods for processing 16S rRNA sequences data.In the present work,based on error-prone PCR model and making use of the normal distribution model,a simulation algorithm for 454sequencer (Tsim) was established to simulate the process of sequencing by synthesis.The simulation results show that the simulator can effectively simulate 454sequencing process.

Key words: 16S rRNA Gene,Simulation algorithm,PCR model,Normal distribution process,454sequencer

[1] Grice E A,Kong H H,Conlan S,et al.Topographical and temporal diversity of the human skin microbiome [J].Science,2009,324(5931):1190-1192
[2] 蒋德明,孙玉华,李丹,等.基于16S rRNA基因序列分析受砷和硫酸盐污染的土壤细菌多样性 [J].微生物学通报,2011,38(10):1592-1601
[3] Oakley B B,Fiedler T L,Marrazzo J M,et al.Diversity of human vaginal bacterial communities and associations with clinically defined bacterial vaginosis [J].Appl Environ Microbial,2008,74(15):4898-4909
[4] Kellenberger E.Exploring the unknown:The silent revolution of microbiology [J].Embo Rep,2001,2(1):5-7
[5] Margulies M,Egholm M,William E,et al.Genome sequencing in microfabricated high-density picolitre reactors [J].Nature,2005,437(7057):376-380
[6] 刘玮琦,茆振川,杨宇红,等.应用16S rRNA基因文库技术分析土壤细菌群落的多样性[J].微生物学报,2008,5(10):1344-1350
[7] Ley R E,Backhed F,Turnbaugh P,et al.Obesity alters gut microbial ecology [J].PNAS,2005,102:11070-11075
[8] 曹波,杨红,许强华,等.基于16S rRNA技术的长江口微生物分子生物学鉴定与分析 [J].上海大学学报,2011(2):191-197
[9] Benoit M R,Li W,Stodieck LS,et al.Microbial antibiotic production aboard the International Space Station [J].Appl Microbial Biotechnology,2006,70:403-411
[10] Richter D C,Ott F,Auch A F,et al.MetaSim-A Sequencing Simulator for Genomics and Metagenomics [J].PLoS ONE,2008,3(10):e3373
[11] Balzer S,Malde K,Lanzén A,et al.Characteristics of 454pyrosequencing data — enabling realistic simulation with flowsim [J].Bioinformatics,2010,26(18):i420-i425
[12] Lysholm F,Andersson B,Persson B.An efficient simulator of 454data using configurable statistical models [J].BMC Research Notes,2011,4:449
[13] Huang Wei-chun,Li Le-ping,Myers J R,et al.ART:a next-ge-neration sequencing read simulator [J].Bioinformatics,2012,28(4):593-594
[14] 宣黎明,韦朝春,李亦学.基于GPU运算的宏基因组第二代测序模拟软件 [J].华东理工大学学报,2012(4):472-476
[15] Huse S M,Welch D M,Morrison H G,et al.Ironing out the wrinkles in the rare biosphere through improved OTU clustering [J].Environ Microbial,2010,12(7):1889-1898
[16] Sharpton mail T J,Samantha,et al.PhylOTU:A High-Th-roughput Procedure Quantifies Microbial Community Diversity and Resolves Novel Taxa from Metagenomic Data [J].PLoS Comput Biol,2011,7(1):1-13
[17] Pritchard L,Corne D,Kell D,et al.A general model of error-prone PCR [J].J Theor Biol,2005,234(4):497-509
[18] Wang Q,Garrity G M,Tiedje J M,et al.Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy [J].Appl Environ Microbiology,2007,73(16):5261-5267

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!