DOI QR코드

DOI QR Code

Computational analysis of SARS-CoV-2, SARS-CoV, and MERS-CoV genome using MEGA

  • Sohpal, Vipan Kumar (Department of Chemical & Bio Engineering, Beant College of Engineering & Technology)
  • Received : 2020.07.20
  • Accepted : 2020.09.22
  • Published : 2020.09.30

Abstract

The novel coronavirus pandemic that has originated from China and spread throughout the world in three months. Genome of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) predecessor, severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV) play an important role in understanding the concept of genetic variation. In this paper, the genomic data accessed from National Center for Biotechnology Information (NCBI) through Molecular Evolutionary Genetic Analysis (MEGA) for statistical analysis. Firstly, the Bayesian information criterion (BIC) and Akaike information criterion (AICc) are used to evaluate the best substitution pattern. Secondly, the maximum likelihood method used to estimate of transition/transversions (R) through Kimura-2, Tamura-3, Hasegawa-Kishino-Yano, and Tamura-Nei nucleotide substitutions model. Thirdly and finally nucleotide frequencies computed based on genomic data of NCBI. The results indicate that general times reversible model has the lowest BIC and AICc score 347,394 and 347,287, respectively. The transition/transversions bias for nucleotide substitutions models varies from 0.56 to 0.59 in MEGA output. The average nitrogenous bases frequency of U, C, A, and G are 31.74, 19.48, 28.04, and 20.74, respectively in percentages. Overall the genomic data analysis of SARS-CoV-2, SARS-CoV, and MERS-CoV highlights the close genetic relationship.

Keywords

References

  1. Zumla A, Chan JF, Azhar EI, Hui DS, Yuen KY. Coronaviruses: drug discovery and therapeutic options. Nat Rev Drug Discov 2016;15:327-347. https://doi.org/10.1038/nrd.2015.37
  2. Paules CI, Marston HD, Fauci AS. Coronavirus infections: more than just the common cold. JAMA 2020;323:707-708. https://doi.org/10.1001/jama.2020.0757
  3. Coronavirus disease (COVID-19) pandemic. Geneva: World Health Organization, 2020. Accessed 2020 Apr 18. Available from: https://www.who.int/emergencies/diseases/novel-coronavirus-2019.
  4. Lv L, Li G, Chen J, Liang X, Li Y. Comparative genomic analysis revealed specific mutation pattern between human coronavirus SARS-CoV-2 and Bat-SARSr-CoV RaTG13. Preprint BioRxiv https://doi.org/10.1101/2020.02.27.969006 (2020).
  5. Rehman SU, Shafique L, Ihsan A, Liu Q. Evolutionary trajectory for the emergence of novel coronavirus SARS-CoV-2. Pathogens 2020;9:240. https://doi.org/10.3390/pathogens9030240
  6. Lau SK, Woo PC, Li KS, Huang Y, Tsoi HW, Wong BH, et al. Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats. Proc Natl Acad Sci U S A 2005;102:14040-14045. https://doi.org/10.1073/pnas.0506735102
  7. Yang Z. Estimating the pattern of nucleotide substitution. J Mol Evol 1994;39:105-111.
  8. Hillis DM, Moritz C, Mable BK. Molecular Systematics. 2nd ed. Sunderland: Sinauer Associates, 1996.
  9. Jukes TH, Cantor CR. Evolution of protein molecules. In: Mammalian Protein Metabolism, Vol. 3 (Munro HN, ed.). New York: Academic Press, 1969. pp. 21-132.
  10. Felsenstein J. Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol 1981;17:368-376. https://doi.org/10.1007/BF01734359
  11. Zharkikh A. Estimation of evolutionary distances between nucleotide sequences. J Mol Evol 1994;39:315-329. https://doi.org/10.1007/BF00160155
  12. Kimura M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol 1980;16:111-120. https://doi.org/10.1007/BF01731581
  13. Kimura M. Estimation of evolutionary distances between homologous nucleotide sequences. Proc Natl Acad Sci U S A 1981;78:454-458. https://doi.org/10.1073/pnas.78.1.454
  14. Hasegawa M, Kishino H, Yano T. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J Mol Evol 1985;22:160-174. https://doi.org/10.1007/BF02101694
  15. Aho K, Derryberry D, Peterson T. Model selection for ecologists: the worldviews of AIC and BIC. Ecology 2014;95:631-636. https://doi.org/10.1890/13-1452.1
  16. Bromham L. Substitution rate analysis and molecular evolution. In: Phylogenetics in the Genomic Era (Scornavacca C, Delsuc F, Galtier N, eds.). The Authors, 2020. pp. 4.4:1-5.5:21.
  17. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol 2018;35:1547-1549. https://doi.org/10.1093/molbev/msy096