Cargando…

Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades

COVID-19 and its causative pathogen SARS-CoV-2 have rushed the world into a staggering pandemic in a few months, and a global fight against both has been intensifying. Here, we describe an analysis procedure where genome composition and its variables are related, through the genetic code to molecula...

Descripción completa

Detalles Bibliográficos
Autores principales: Teng, Xufei, Li, Qianpeng, Li, Zhao, Zhang, Yuansheng, Niu, Guangyi, Xiao, Jingfa, Yu, Jun, Zhang, Zhang, Song, Shuhui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7875716/
https://www.ncbi.nlm.nih.gov/pubmed/33581339
http://dx.doi.org/10.1016/j.gpb.2020.10.003
_version_ 1783649818220953600
author Teng, Xufei
Li, Qianpeng
Li, Zhao
Zhang, Yuansheng
Niu, Guangyi
Xiao, Jingfa
Yu, Jun
Zhang, Zhang
Song, Shuhui
author_facet Teng, Xufei
Li, Qianpeng
Li, Zhao
Zhang, Yuansheng
Niu, Guangyi
Xiao, Jingfa
Yu, Jun
Zhang, Zhang
Song, Shuhui
author_sort Teng, Xufei
collection PubMed
description COVID-19 and its causative pathogen SARS-CoV-2 have rushed the world into a staggering pandemic in a few months, and a global fight against both has been intensifying. Here, we describe an analysis procedure where genome composition and its variables are related, through the genetic code to molecular mechanisms, based on understanding of RNA replication and its feedback loop from mutation to viral proteome sequence fraternity including effective sites on the replicase-transcriptase complex. Our analysis starts with primary sequence information, identity-based phylogeny based on 22,051 SARS-CoV-2 sequences, and evaluation of sequence variation patterns as mutation spectra and its 12 permutations among organized clades. All are tailored to two key mechanisms: strand-biased and function-associated mutations. Our findings are listed as follows: 1) The most dominant mutation is C-to-U permutation, whose abundant second-codon-position counts alter amino acid composition toward higher molecular weight and lower hydrophobicity, albeit assumed most slightly deleterious. 2) The second abundance group includes three negative-strand mutations (U-to-C, A-to-G, and G-to-A) and a positive-strand mutation (G-to-U) due to DNA repair mechanisms after cellular abasic events. 3) A clade-associated biased mutation trend is found attributable to elevated level of negative-sense strand synthesis. 4) Within-clade permutation variation is very informative for associating non-synonymous mutations and viral proteome changes. These findings demand a platform where emerging mutations are mapped onto mostly subtle but fast-adjusting viral proteomes and transcriptomes, to provide biological and clinical information after logical convergence for effective pharmaceutical and diagnostic applications. Such actions are in desperate need, especially in the middle of the War against COVID-19.
format Online
Article
Text
id pubmed-7875716
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-78757162021-02-11 Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades Teng, Xufei Li, Qianpeng Li, Zhao Zhang, Yuansheng Niu, Guangyi Xiao, Jingfa Yu, Jun Zhang, Zhang Song, Shuhui Genomics Proteomics Bioinformatics Original Research COVID-19 and its causative pathogen SARS-CoV-2 have rushed the world into a staggering pandemic in a few months, and a global fight against both has been intensifying. Here, we describe an analysis procedure where genome composition and its variables are related, through the genetic code to molecular mechanisms, based on understanding of RNA replication and its feedback loop from mutation to viral proteome sequence fraternity including effective sites on the replicase-transcriptase complex. Our analysis starts with primary sequence information, identity-based phylogeny based on 22,051 SARS-CoV-2 sequences, and evaluation of sequence variation patterns as mutation spectra and its 12 permutations among organized clades. All are tailored to two key mechanisms: strand-biased and function-associated mutations. Our findings are listed as follows: 1) The most dominant mutation is C-to-U permutation, whose abundant second-codon-position counts alter amino acid composition toward higher molecular weight and lower hydrophobicity, albeit assumed most slightly deleterious. 2) The second abundance group includes three negative-strand mutations (U-to-C, A-to-G, and G-to-A) and a positive-strand mutation (G-to-U) due to DNA repair mechanisms after cellular abasic events. 3) A clade-associated biased mutation trend is found attributable to elevated level of negative-sense strand synthesis. 4) Within-clade permutation variation is very informative for associating non-synonymous mutations and viral proteome changes. These findings demand a platform where emerging mutations are mapped onto mostly subtle but fast-adjusting viral proteomes and transcriptomes, to provide biological and clinical information after logical convergence for effective pharmaceutical and diagnostic applications. Such actions are in desperate need, especially in the middle of the War against COVID-19. Elsevier 2020-12 2021-02-11 /pmc/articles/PMC7875716/ /pubmed/33581339 http://dx.doi.org/10.1016/j.gpb.2020.10.003 Text en © 2021 Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Research
Teng, Xufei
Li, Qianpeng
Li, Zhao
Zhang, Yuansheng
Niu, Guangyi
Xiao, Jingfa
Yu, Jun
Zhang, Zhang
Song, Shuhui
Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades
title Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades
title_full Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades
title_fullStr Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades
title_full_unstemmed Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades
title_short Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades
title_sort compositional variability and mutation spectra of monophyletic sars-cov-2 clades
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7875716/
https://www.ncbi.nlm.nih.gov/pubmed/33581339
http://dx.doi.org/10.1016/j.gpb.2020.10.003
work_keys_str_mv AT tengxufei compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades
AT liqianpeng compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades
AT lizhao compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades
AT zhangyuansheng compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades
AT niuguangyi compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades
AT xiaojingfa compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades
AT yujun compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades
AT zhangzhang compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades
AT songshuhui compositionalvariabilityandmutationspectraofmonophyleticsarscov2clades