计算机科学 ›› 2020, Vol. 47 ›› Issue (11A): 192-195.doi: 10.11896/jsjkx.191200070

• 计算机图形学&多媒体 • 上一篇    下一篇

厚壁菌门下两类细菌的DNA全序列可视化研究

杜流云, 郑智捷, 郑华仙   

  1. 云南大学软件学院 昆明 650500
  • 出版日期:2020-11-15 发布日期:2020-11-17
  • 通讯作者: 郑智捷(conjugatelogic@yahoo.com)
  • 作者简介:617279723@qq.com
  • 基金资助:
    国家自然科学基金(K1020720);中国云南省海外高级学者项目(W811305);中国云南省科技计划项目(KC1810123);云南省科技厅下一代互联网电子信息重大专项(2018ZI002)

Visualization of DNA Sequences of Two Kinds of Bacteria Under Firmicutes

DU Liu-yun, ZHENG Zhi-jie, ZHENG Hua-xian   

  1. School of Software,Yunnan University,Kunming 650500,China
  • Online:2020-11-15 Published:2020-11-17
  • About author:DU Liu-yun,born in 1995,master.Her main research interests includecomputerbiology science.
    ZHENG Zhi-jie,born in 1956,professor.His main research interests include computer biology science and quantum cryptography.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China(K1020720),Overseas Senior Scholar Program of Yunnan Province(W811305),Science and Technology Program of Yunnan Province(KC1810123) and Key Project on Electric Information and Next Generation IT Technology of Yunnan (2018ZI002).

摘要: 为了探索生物层次间的关系,从复杂细菌群落到各类种属细菌,全序列DNA测序蓬勃发展,对各种生物基因序列科学计算数据可视化的需求日益迫切。DNA序列双螺旋结构与空间复杂结构的互补对称性对于探索大量的长DNA序列具有重要意义。文中从“序列决定结构,结构决定功能”这一核心思想出发,基于一种基于变值体系的测量模型和方法,利用信息技术和统计学结合的方法对芽孢杆菌(Bacillus)和分枝杆菌(Mycobacterium)两类细菌的DNA全序列进行分析比较,展现了它们的DNA全序列二维特征分布,以可视化的形式显示这两类细菌的异同。与传统的细菌可视化方法相比,该方法具有时间复杂度低、稳定性好、直观性强、易于理解的特点。在不同的测量参考选择下提供了一系列分布图示。

关键词: DNA, 变值体系, 测量模型, 基因序列, 可视化

Abstract: In order to explore the relationship between biological layers,from complex bacterial communities to various species and genera,full sequence DNA sequencing has developed vigorously,and the need for visualization of scientific calculation data of various biological gene sequences has become increasingly urgent.The complementary symmetry between the double helix structure of DNA sequence and the complex structure of space is of great significance for exploring a large number of long DNA sequences.Starting from the core idea of “sequence determines structure and structure determines function”,this paper uses a measurement model and method based on variant value system to analyze and compare the complete DNA sequences of two kinds of bacteria,bacillus and Mycobacterium by using the method of combination of information technology and statistics,showing their two-dimensional characteristic distribution of DNA sequences.The visual form shows the similarities and differences between the two kinds of bacteria.Compared with the traditional bacterial visualization method,this method has the characteristics of low time complexity,good stability,strong intuition and easy to understand.A series of distribution diagrams are provided under different measurement reference options.

Key words: DNA, Gene sequence, Measurement model, Variant system, Visualization

中图分类号: 

  • TP399
[1] ZHU P X.Progress in microbial genome sequencing[J].Microorganisms and infection,1998(6):28-32.
[2] SHI C,SUN Y,XIAO B,et al.Research on the discovery of new antibiotics by genome data mining[J].Journal of pharmacy,2018(6):845-851.
[3] WANG J,GUO L,WU J S,et al.Current situation of bioinformatics research in the context of big data [J].Journal of Nanjing University of Posts and Telecommunications (Natural Science Edition),2017,37(4):62-67.
[4] LV H Q,HAO L L,LIU E H,et al.Research status and development trend of Hi-C based on bioinformatics [J/OL].Genetics:1-14.[2019-12-05].Http://kns.cnki.net/kcms/detail/11.1913.r.20191127.1304.006.html.
[5] WAN Z,ZHENG Z J E.Visualization of one-dimensional segmented measurement and distribution of DNA sequences [J].Journal of Yunnan University (Natural Science Edition),2013,35(S2).
[6] LI C,LIU H,CHU W W,et al.Two dimensional graphical representation of DNA sequence and its application [J].Journal of Bohai University (Natural Science Edition),2014,35(4):307-312,324.
[7] LI G L,RUAN Y,JGU R S,et al.Research on three-dimensional genomics [J].Science Bulletin,2014,59(13):1165-1172.
[8] TANG X C,ZHOU P P,QIU W Y.DNA sequence similarity analysis based on 4D graphical representation [J].Science Bulletin,2010,55(6):442-446.
[9] YANG C,YANG R F,CUI Y J,et al.Methods and applications of bacterial genome-wide association research[J].Genetics,2018,40(1):57-65.
[10] LI C,LIU H,CHU W W,et al.Two dimensional graphical representation and application of DNA sequence non degradation [J].Journal of Bohai University (Natural Science Edition),2014,35(4):307-312,324.
[11] ZHU T,ZENG D N,WHOLE P.Comparative analysis of bacterial community structure of skin between infant and mother[J].Microorganism and infection,2019,14(2):89.
[12] ZHENG J.Variant Construction From Theoretical Foundation toApplications[M].Springer Singapore,2019:56.
[13] JI Y.Research on Visualization Application of ECG data series based on VaR Measurement [D].Kunming:Yunnan University,2016.
[14] LENG L H,ZHENG Z J.Visualization of ECG sequence of sinus arrhythmia [J].Computer Science,2016,43 (S2):183-185.
[15] MAO Y,ZHENG J,LIU W.Mapping Whole DNA Sequence on Variant Maps[C]//IEEE/ACM International Conference.ACM,2017:1037-1040.
[1] 王坤姝, 张泽辉, 高铁杠.
基于Hachimoji DNA和QR分解的遥感图像可逆隐藏算法
Reversible Hidden Algorithm for Remote Sensing Images Based on Hachimoji DNA and QR Decomposition
计算机科学, 2022, 49(8): 127-135. https://doi.org/10.11896/jsjkx.210700216
[2] 杨啸, 王翔坤, 胡浩, 朱敏.
面向设备状态监测的可视化技术综述
Survey on Visualization Technology for Equipment Condition Monitoring
计算机科学, 2022, 49(7): 89-99. https://doi.org/10.11896/jsjkx.210900167
[3] 孙福权, 梁莹.
基于XGBoost算法的水稻基因组6mA位点识别研究
Identification of 6mA Sites in Rice Genome Based on XGBoost Algorithm
计算机科学, 2022, 49(6A): 309-313. https://doi.org/10.11896/jsjkx.210700262
[4] 陈慧嫔, 王琨, 杨恒, 郑智捷.
蓝舌病毒基因组序列多元概率特征可视化分析
Visual Analysis of Multiple Probability Features of Bluetongue Virus Genome Sequence
计算机科学, 2022, 49(6A): 27-31. https://doi.org/10.11896/jsjkx.210300129
[5] 朱敏, 梁朝晖, 姚林, 王翔坤, 曹梦琦.
学术引用信息可视化方法综述
Survey of Visualization Methods on Academic Citation Information
计算机科学, 2022, 49(4): 88-99. https://doi.org/10.11896/jsjkx.210300219
[6] 李家振, 纪庆革, 朱泳霖.
分子可视化中的光线追踪棋盘渲染
Ray Tracing Checkerboard Rendering in Molecular Visualization
计算机科学, 2022, 49(2): 134-141. https://doi.org/10.11896/jsjkx.210900126
[7] 陈伟, 李杭, 李维华.
核小体定位预测的集成学习方法
Ensemble Learning Method for Nucleosome Localization Prediction
计算机科学, 2022, 49(2): 285-291. https://doi.org/10.11896/jsjkx.201100195
[8] 李家振, 纪庆革.
动态低采样环境光遮蔽的实时光线追踪分子渲染
Dynamic Low-sampling Ambient Occlusion Real-time Ray Tracing for Molecular Rendering
计算机科学, 2022, 49(1): 175-180. https://doi.org/10.11896/jsjkx.210200042
[9] 吴立波, 黄玉芳.
基于DNA链置换的逻辑推理问题研究
Logical Reasoning Based on DNA Strand Displacement
计算机科学, 2022, 49(1): 259-263. https://doi.org/10.11896/jsjkx.210200131
[10] 骆菁菁, 唐卫贞, 丁继婷.
基于皮尔逊系数的管制仿真训练数据独立化与因子分析下的数据可视化研究
Research of ATC Simulator Training Values Independence Based on Pearson Correlation Coefficient and Study of Data Visualization Based on Factor Analysis
计算机科学, 2021, 48(6A): 623-628. https://doi.org/10.11896/jsjkx.210200021
[11] 苏庆, 黎智洲, 刘添添, 吴伟民, 黄剑锋, 李小妹.
程序调试中的树形结构演变可视化模型
Tree Structure Evaluation Visualization Model for Program Debugging
计算机科学, 2021, 48(5): 68-74. https://doi.org/10.11896/jsjkx.200100133
[12] 鄂海红, 张田宇, 宋美娜.
基于Web的数据可视化图表渲染优化方法
Web-based Data Visualization Chart Rendering Optimization Method
计算机科学, 2021, 48(3): 119-123. https://doi.org/10.11896/jsjkx.200600038
[13] 张倩, 肖丽.
基于流线的流场可视化绘制方法综述
Review of Visualization Drawing Methods of Flow Field Based on Streamlines
计算机科学, 2021, 48(12): 1-7. https://doi.org/10.11896/jsjkx.201200108
[14] 马梦宇, 吴烨, 陈荦, 伍江江, 李军, 景宁.
显示导向型的大规模地理矢量实时可视化技术
Display-oriented Data Visualization Technique for Large-scale Geographic Vector Data
计算机科学, 2020, 47(9): 117-122. https://doi.org/10.11896/jsjkx.190800121
[15] 吕泽宇李纪旋陈如剑陈东明.
电商平台用户再购物行为的预测研究
Research on Prediction of Re-shopping Behavior of E-commerce Customers
计算机科学, 2020, 47(6A): 424-428. https://doi.org/10.11896/JsJkx.190900018
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!