Cargando…

The substitution spectra of coronavirus genomes

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has triggered an unprecedented international effort to sequence complete viral genomes. We leveraged this wealth of information to characterize the substitution spectrum of SARS-CoV-2 and to compare it with those of other huma...

Descripción completa

Detalles Bibliográficos
Autores principales: Forni, Diego, Cagliani, Rachele, Pontremoli, Chiara, Clerici, Mario, Sironi, Manuela
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8499949/
https://www.ncbi.nlm.nih.gov/pubmed/34518866
http://dx.doi.org/10.1093/bib/bbab382
_version_ 1784580376588451840
author Forni, Diego
Cagliani, Rachele
Pontremoli, Chiara
Clerici, Mario
Sironi, Manuela
author_facet Forni, Diego
Cagliani, Rachele
Pontremoli, Chiara
Clerici, Mario
Sironi, Manuela
author_sort Forni, Diego
collection PubMed
description The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has triggered an unprecedented international effort to sequence complete viral genomes. We leveraged this wealth of information to characterize the substitution spectrum of SARS-CoV-2 and to compare it with those of other human and animal coronaviruses. We show that, once nucleotide composition is taken into account, human and most animal coronaviruses display a mutation spectrum dominated by C to U and G to U substitutions, a feature that is not shared by other positive-sense RNA viruses. However, the proportions of C to U and G to U substitutions tend to decrease as divergence increases, suggesting that, whatever their origin, a proportion of these changes is subsequently eliminated by purifying selection. Analysis of the sequence context of C to U substitutions showed little evidence of apolipoprotein B mRNA editing catalytic polypeptide-like (APOBEC)-mediated editing and such contexts were similar for SARS-CoV-2 and Middle East respiratory syndrome coronavirus sampled from different hosts, despite different repertoires of APOBEC3 proteins in distinct species. Conversely, we found evidence that C to U and G to U changes affect CpG dinucleotides at a frequency higher than expected. Whereas this suggests ongoing selective reduction of CpGs, this effect alone cannot account for the substitution spectra. Finally, we show that, during the first months of SARS-CoV-2 pandemic spread, the frequency of both G to U and C to U substitutions increased. Our data suggest that the substitution spectrum of SARS-CoV-2 is determined by an interplay of factors, including intrinsic biases of the replication process, avoidance of CpG dinucleotides and other constraints exerted by the new host.
format Online
Article
Text
id pubmed-8499949
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84999492021-10-08 The substitution spectra of coronavirus genomes Forni, Diego Cagliani, Rachele Pontremoli, Chiara Clerici, Mario Sironi, Manuela Brief Bioinform Problem Solving Protocol The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has triggered an unprecedented international effort to sequence complete viral genomes. We leveraged this wealth of information to characterize the substitution spectrum of SARS-CoV-2 and to compare it with those of other human and animal coronaviruses. We show that, once nucleotide composition is taken into account, human and most animal coronaviruses display a mutation spectrum dominated by C to U and G to U substitutions, a feature that is not shared by other positive-sense RNA viruses. However, the proportions of C to U and G to U substitutions tend to decrease as divergence increases, suggesting that, whatever their origin, a proportion of these changes is subsequently eliminated by purifying selection. Analysis of the sequence context of C to U substitutions showed little evidence of apolipoprotein B mRNA editing catalytic polypeptide-like (APOBEC)-mediated editing and such contexts were similar for SARS-CoV-2 and Middle East respiratory syndrome coronavirus sampled from different hosts, despite different repertoires of APOBEC3 proteins in distinct species. Conversely, we found evidence that C to U and G to U changes affect CpG dinucleotides at a frequency higher than expected. Whereas this suggests ongoing selective reduction of CpGs, this effect alone cannot account for the substitution spectra. Finally, we show that, during the first months of SARS-CoV-2 pandemic spread, the frequency of both G to U and C to U substitutions increased. Our data suggest that the substitution spectrum of SARS-CoV-2 is determined by an interplay of factors, including intrinsic biases of the replication process, avoidance of CpG dinucleotides and other constraints exerted by the new host. Oxford University Press 2021-09-13 /pmc/articles/PMC8499949/ /pubmed/34518866 http://dx.doi.org/10.1093/bib/bbab382 Text en © The Author(s) 2021 Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Forni, Diego
Cagliani, Rachele
Pontremoli, Chiara
Clerici, Mario
Sironi, Manuela
The substitution spectra of coronavirus genomes
title The substitution spectra of coronavirus genomes
title_full The substitution spectra of coronavirus genomes
title_fullStr The substitution spectra of coronavirus genomes
title_full_unstemmed The substitution spectra of coronavirus genomes
title_short The substitution spectra of coronavirus genomes
title_sort substitution spectra of coronavirus genomes
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8499949/
https://www.ncbi.nlm.nih.gov/pubmed/34518866
http://dx.doi.org/10.1093/bib/bbab382
work_keys_str_mv AT fornidiego thesubstitutionspectraofcoronavirusgenomes
AT caglianirachele thesubstitutionspectraofcoronavirusgenomes
AT pontremolichiara thesubstitutionspectraofcoronavirusgenomes
AT clericimario thesubstitutionspectraofcoronavirusgenomes
AT sironimanuela thesubstitutionspectraofcoronavirusgenomes
AT fornidiego substitutionspectraofcoronavirusgenomes
AT caglianirachele substitutionspectraofcoronavirusgenomes
AT pontremolichiara substitutionspectraofcoronavirusgenomes
AT clericimario substitutionspectraofcoronavirusgenomes
AT sironimanuela substitutionspectraofcoronavirusgenomes