Cargando…
Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades
COVID-19 and its causative pathogen SARS-CoV-2 have rushed the world into a staggering pandemic in a few months, and a global fight against both has been intensifying. Here, we describe an analysis procedure where genome composition and its variables are related, through the genetic code to molecula...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7875716/ https://www.ncbi.nlm.nih.gov/pubmed/33581339 http://dx.doi.org/10.1016/j.gpb.2020.10.003 |
_version_ | 1783649818220953600 |
---|---|
author | Teng, Xufei Li, Qianpeng Li, Zhao Zhang, Yuansheng Niu, Guangyi Xiao, Jingfa Yu, Jun Zhang, Zhang Song, Shuhui |
author_facet | Teng, Xufei Li, Qianpeng Li, Zhao Zhang, Yuansheng Niu, Guangyi Xiao, Jingfa Yu, Jun Zhang, Zhang Song, Shuhui |
author_sort | Teng, Xufei |
collection | PubMed |
description | COVID-19 and its causative pathogen SARS-CoV-2 have rushed the world into a staggering pandemic in a few months, and a global fight against both has been intensifying. Here, we describe an analysis procedure where genome composition and its variables are related, through the genetic code to molecular mechanisms, based on understanding of RNA replication and its feedback loop from mutation to viral proteome sequence fraternity including effective sites on the replicase-transcriptase complex. Our analysis starts with primary sequence information, identity-based phylogeny based on 22,051 SARS-CoV-2 sequences, and evaluation of sequence variation patterns as mutation spectra and its 12 permutations among organized clades. All are tailored to two key mechanisms: strand-biased and function-associated mutations. Our findings are listed as follows: 1) The most dominant mutation is C-to-U permutation, whose abundant second-codon-position counts alter amino acid composition toward higher molecular weight and lower hydrophobicity, albeit assumed most slightly deleterious. 2) The second abundance group includes three negative-strand mutations (U-to-C, A-to-G, and G-to-A) and a positive-strand mutation (G-to-U) due to DNA repair mechanisms after cellular abasic events. 3) A clade-associated biased mutation trend is found attributable to elevated level of negative-sense strand synthesis. 4) Within-clade permutation variation is very informative for associating non-synonymous mutations and viral proteome changes. These findings demand a platform where emerging mutations are mapped onto mostly subtle but fast-adjusting viral proteomes and transcriptomes, to provide biological and clinical information after logical convergence for effective pharmaceutical and diagnostic applications. Such actions are in desperate need, especially in the middle of the War against COVID-19. |
format | Online Article Text |
id | pubmed-7875716 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-78757162021-02-11 Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades Teng, Xufei Li, Qianpeng Li, Zhao Zhang, Yuansheng Niu, Guangyi Xiao, Jingfa Yu, Jun Zhang, Zhang Song, Shuhui Genomics Proteomics Bioinformatics Original Research COVID-19 and its causative pathogen SARS-CoV-2 have rushed the world into a staggering pandemic in a few months, and a global fight against both has been intensifying. Here, we describe an analysis procedure where genome composition and its variables are related, through the genetic code to molecular mechanisms, based on understanding of RNA replication and its feedback loop from mutation to viral proteome sequence fraternity including effective sites on the replicase-transcriptase complex. Our analysis starts with primary sequence information, identity-based phylogeny based on 22,051 SARS-CoV-2 sequences, and evaluation of sequence variation patterns as mutation spectra and its 12 permutations among organized clades. All are tailored to two key mechanisms: strand-biased and function-associated mutations. Our findings are listed as follows: 1) The most dominant mutation is C-to-U permutation, whose abundant second-codon-position counts alter amino acid composition toward higher molecular weight and lower hydrophobicity, albeit assumed most slightly deleterious. 2) The second abundance group includes three negative-strand mutations (U-to-C, A-to-G, and G-to-A) and a positive-strand mutation (G-to-U) due to DNA repair mechanisms after cellular abasic events. 3) A clade-associated biased mutation trend is found attributable to elevated level of negative-sense strand synthesis. 4) Within-clade permutation variation is very informative for associating non-synonymous mutations and viral proteome changes. These findings demand a platform where emerging mutations are mapped onto mostly subtle but fast-adjusting viral proteomes and transcriptomes, to provide biological and clinical information after logical convergence for effective pharmaceutical and diagnostic applications. Such actions are in desperate need, especially in the middle of the War against COVID-19. Elsevier 2020-12 2021-02-11 /pmc/articles/PMC7875716/ /pubmed/33581339 http://dx.doi.org/10.1016/j.gpb.2020.10.003 Text en © 2021 Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Original Research Teng, Xufei Li, Qianpeng Li, Zhao Zhang, Yuansheng Niu, Guangyi Xiao, Jingfa Yu, Jun Zhang, Zhang Song, Shuhui Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades |
title | Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades |
title_full | Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades |
title_fullStr | Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades |
title_full_unstemmed | Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades |
title_short | Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades |
title_sort | compositional variability and mutation spectra of monophyletic sars-cov-2 clades |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7875716/ https://www.ncbi.nlm.nih.gov/pubmed/33581339 http://dx.doi.org/10.1016/j.gpb.2020.10.003 |
work_keys_str_mv | AT tengxufei compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades AT liqianpeng compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades AT lizhao compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades AT zhangyuansheng compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades AT niuguangyi compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades AT xiaojingfa compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades AT yujun compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades AT zhangzhang compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades AT songshuhui compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades |