Cargando…
The substitution spectra of coronavirus genomes
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has triggered an unprecedented international effort to sequence complete viral genomes. We leveraged this wealth of information to characterize the substitution spectrum of SARS-CoV-2 and to compare it with those of other huma...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8499949/ https://www.ncbi.nlm.nih.gov/pubmed/34518866 http://dx.doi.org/10.1093/bib/bbab382 |
_version_ | 1784580376588451840 |
---|---|
author | Forni, Diego Cagliani, Rachele Pontremoli, Chiara Clerici, Mario Sironi, Manuela |
author_facet | Forni, Diego Cagliani, Rachele Pontremoli, Chiara Clerici, Mario Sironi, Manuela |
author_sort | Forni, Diego |
collection | PubMed |
description | The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has triggered an unprecedented international effort to sequence complete viral genomes. We leveraged this wealth of information to characterize the substitution spectrum of SARS-CoV-2 and to compare it with those of other human and animal coronaviruses. We show that, once nucleotide composition is taken into account, human and most animal coronaviruses display a mutation spectrum dominated by C to U and G to U substitutions, a feature that is not shared by other positive-sense RNA viruses. However, the proportions of C to U and G to U substitutions tend to decrease as divergence increases, suggesting that, whatever their origin, a proportion of these changes is subsequently eliminated by purifying selection. Analysis of the sequence context of C to U substitutions showed little evidence of apolipoprotein B mRNA editing catalytic polypeptide-like (APOBEC)-mediated editing and such contexts were similar for SARS-CoV-2 and Middle East respiratory syndrome coronavirus sampled from different hosts, despite different repertoires of APOBEC3 proteins in distinct species. Conversely, we found evidence that C to U and G to U changes affect CpG dinucleotides at a frequency higher than expected. Whereas this suggests ongoing selective reduction of CpGs, this effect alone cannot account for the substitution spectra. Finally, we show that, during the first months of SARS-CoV-2 pandemic spread, the frequency of both G to U and C to U substitutions increased. Our data suggest that the substitution spectrum of SARS-CoV-2 is determined by an interplay of factors, including intrinsic biases of the replication process, avoidance of CpG dinucleotides and other constraints exerted by the new host. |
format | Online Article Text |
id | pubmed-8499949 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-84999492021-10-08 The substitution spectra of coronavirus genomes Forni, Diego Cagliani, Rachele Pontremoli, Chiara Clerici, Mario Sironi, Manuela Brief Bioinform Problem Solving Protocol The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has triggered an unprecedented international effort to sequence complete viral genomes. We leveraged this wealth of information to characterize the substitution spectrum of SARS-CoV-2 and to compare it with those of other human and animal coronaviruses. We show that, once nucleotide composition is taken into account, human and most animal coronaviruses display a mutation spectrum dominated by C to U and G to U substitutions, a feature that is not shared by other positive-sense RNA viruses. However, the proportions of C to U and G to U substitutions tend to decrease as divergence increases, suggesting that, whatever their origin, a proportion of these changes is subsequently eliminated by purifying selection. Analysis of the sequence context of C to U substitutions showed little evidence of apolipoprotein B mRNA editing catalytic polypeptide-like (APOBEC)-mediated editing and such contexts were similar for SARS-CoV-2 and Middle East respiratory syndrome coronavirus sampled from different hosts, despite different repertoires of APOBEC3 proteins in distinct species. Conversely, we found evidence that C to U and G to U changes affect CpG dinucleotides at a frequency higher than expected. Whereas this suggests ongoing selective reduction of CpGs, this effect alone cannot account for the substitution spectra. Finally, we show that, during the first months of SARS-CoV-2 pandemic spread, the frequency of both G to U and C to U substitutions increased. Our data suggest that the substitution spectrum of SARS-CoV-2 is determined by an interplay of factors, including intrinsic biases of the replication process, avoidance of CpG dinucleotides and other constraints exerted by the new host. Oxford University Press 2021-09-13 /pmc/articles/PMC8499949/ /pubmed/34518866 http://dx.doi.org/10.1093/bib/bbab382 Text en © The Author(s) 2021 Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Problem Solving Protocol Forni, Diego Cagliani, Rachele Pontremoli, Chiara Clerici, Mario Sironi, Manuela The substitution spectra of coronavirus genomes |
title | The substitution spectra of coronavirus genomes |
title_full | The substitution spectra of coronavirus genomes |
title_fullStr | The substitution spectra of coronavirus genomes |
title_full_unstemmed | The substitution spectra of coronavirus genomes |
title_short | The substitution spectra of coronavirus genomes |
title_sort | substitution spectra of coronavirus genomes |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8499949/ https://www.ncbi.nlm.nih.gov/pubmed/34518866 http://dx.doi.org/10.1093/bib/bbab382 |
work_keys_str_mv | AT fornidiego thesubstitutionspectraofcoronavirusgenomes AT caglianirachele thesubstitutionspectraofcoronavirusgenomes AT pontremolichiara thesubstitutionspectraofcoronavirusgenomes AT clericimario thesubstitutionspectraofcoronavirusgenomes AT sironimanuela thesubstitutionspectraofcoronavirusgenomes AT fornidiego substitutionspectraofcoronavirusgenomes AT caglianirachele substitutionspectraofcoronavirusgenomes AT pontremolichiara substitutionspectraofcoronavirusgenomes AT clericimario substitutionspectraofcoronavirusgenomes AT sironimanuela substitutionspectraofcoronavirusgenomes |