Cargando…

New Pathways of Mutational Change in SARS-CoV-2 Proteomes Involve Regions of Intrinsic Disorder Important for Virus Replication and Release

The massive worldwide spread of the SARS-CoV-2 virus is fueling the COVID-19 pandemic. Since the first whole-genome sequence was published in January 2020, a growing database of tens of thousands of viral genomes has been constructed. This offers opportunities to study pathways of molecular change i...

Descripción completa

Detalles Bibliográficos
Autores principales: Tomaszewski, Tre, DeVries, Ryan S, Dong, Mengyi, Bhatia, Gitanshu, Norsworthy, Miles D, Zheng, Xuying, Caetano-Anollés, Gustavo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7586267/
https://www.ncbi.nlm.nih.gov/pubmed/33149541
http://dx.doi.org/10.1177/1176934320965149
_version_ 1783599960611094528
author Tomaszewski, Tre
DeVries, Ryan S
Dong, Mengyi
Bhatia, Gitanshu
Norsworthy, Miles D
Zheng, Xuying
Caetano-Anollés, Gustavo
author_facet Tomaszewski, Tre
DeVries, Ryan S
Dong, Mengyi
Bhatia, Gitanshu
Norsworthy, Miles D
Zheng, Xuying
Caetano-Anollés, Gustavo
author_sort Tomaszewski, Tre
collection PubMed
description The massive worldwide spread of the SARS-CoV-2 virus is fueling the COVID-19 pandemic. Since the first whole-genome sequence was published in January 2020, a growing database of tens of thousands of viral genomes has been constructed. This offers opportunities to study pathways of molecular change in the expanding viral population that can help identify molecular culprits of virulence and virus spread. Here we investigate the genomic accumulation of mutations at various time points of the early pandemic to identify changes in mutationally highly active genomic regions that are occurring worldwide. We used the Wuhan NC_045512.2 sequence as a reference and sampled 15 342 indexed sequences from GISAID, translating them into proteins and grouping them by month of deposition. The per-position amino acid frequencies and Shannon entropies of the coding sequences were calculated for each month, and a map of intrinsic disorder regions and binding sites was generated. The analysis revealed dominant variants, most of which were located in loop regions and on the surface of the proteins. Mutation entropy decreased between March and April of 2020 after steady increases at several sites, including the D614G mutation site of the spike (S) protein that was previously found associated with higher case fatality rates and at sites of the NSP12 polymerase and the NSP13 helicase proteins. Notable expanding mutations include R203K and G204R of the nucleocapsid (N) protein inter-domain linker region and G251V of the viroporin encoded by ORF3a between March and April. The regions spanning these mutations exhibited significant intrinsic disorder, which was enhanced and decreased by the N-protein and viroporin 3a protein mutations, respectively. These results predict an ongoing mutational shift from the spike and replication complex to other regions, especially to encoded molecules known to represent major β-interferon antagonists. The study provides valuable information for therapeutics and vaccine design, as well as insight into mutation tendencies that could facilitate preventive control.
format Online
Article
Text
id pubmed-7586267
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-75862672020-11-03 New Pathways of Mutational Change in SARS-CoV-2 Proteomes Involve Regions of Intrinsic Disorder Important for Virus Replication and Release Tomaszewski, Tre DeVries, Ryan S Dong, Mengyi Bhatia, Gitanshu Norsworthy, Miles D Zheng, Xuying Caetano-Anollés, Gustavo Evol Bioinform Online Original Research The massive worldwide spread of the SARS-CoV-2 virus is fueling the COVID-19 pandemic. Since the first whole-genome sequence was published in January 2020, a growing database of tens of thousands of viral genomes has been constructed. This offers opportunities to study pathways of molecular change in the expanding viral population that can help identify molecular culprits of virulence and virus spread. Here we investigate the genomic accumulation of mutations at various time points of the early pandemic to identify changes in mutationally highly active genomic regions that are occurring worldwide. We used the Wuhan NC_045512.2 sequence as a reference and sampled 15 342 indexed sequences from GISAID, translating them into proteins and grouping them by month of deposition. The per-position amino acid frequencies and Shannon entropies of the coding sequences were calculated for each month, and a map of intrinsic disorder regions and binding sites was generated. The analysis revealed dominant variants, most of which were located in loop regions and on the surface of the proteins. Mutation entropy decreased between March and April of 2020 after steady increases at several sites, including the D614G mutation site of the spike (S) protein that was previously found associated with higher case fatality rates and at sites of the NSP12 polymerase and the NSP13 helicase proteins. Notable expanding mutations include R203K and G204R of the nucleocapsid (N) protein inter-domain linker region and G251V of the viroporin encoded by ORF3a between March and April. The regions spanning these mutations exhibited significant intrinsic disorder, which was enhanced and decreased by the N-protein and viroporin 3a protein mutations, respectively. These results predict an ongoing mutational shift from the spike and replication complex to other regions, especially to encoded molecules known to represent major β-interferon antagonists. The study provides valuable information for therapeutics and vaccine design, as well as insight into mutation tendencies that could facilitate preventive control. SAGE Publications 2020-10-23 /pmc/articles/PMC7586267/ /pubmed/33149541 http://dx.doi.org/10.1177/1176934320965149 Text en © The Author(s) 2020 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Tomaszewski, Tre
DeVries, Ryan S
Dong, Mengyi
Bhatia, Gitanshu
Norsworthy, Miles D
Zheng, Xuying
Caetano-Anollés, Gustavo
New Pathways of Mutational Change in SARS-CoV-2 Proteomes Involve Regions of Intrinsic Disorder Important for Virus Replication and Release
title New Pathways of Mutational Change in SARS-CoV-2 Proteomes Involve Regions of Intrinsic Disorder Important for Virus Replication and Release
title_full New Pathways of Mutational Change in SARS-CoV-2 Proteomes Involve Regions of Intrinsic Disorder Important for Virus Replication and Release
title_fullStr New Pathways of Mutational Change in SARS-CoV-2 Proteomes Involve Regions of Intrinsic Disorder Important for Virus Replication and Release
title_full_unstemmed New Pathways of Mutational Change in SARS-CoV-2 Proteomes Involve Regions of Intrinsic Disorder Important for Virus Replication and Release
title_short New Pathways of Mutational Change in SARS-CoV-2 Proteomes Involve Regions of Intrinsic Disorder Important for Virus Replication and Release
title_sort new pathways of mutational change in sars-cov-2 proteomes involve regions of intrinsic disorder important for virus replication and release
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7586267/
https://www.ncbi.nlm.nih.gov/pubmed/33149541
http://dx.doi.org/10.1177/1176934320965149
work_keys_str_mv AT tomaszewskitre newpathwaysofmutationalchangeinsarscov2proteomesinvolveregionsofintrinsicdisorderimportantforvirusreplicationandrelease
AT devriesryans newpathwaysofmutationalchangeinsarscov2proteomesinvolveregionsofintrinsicdisorderimportantforvirusreplicationandrelease
AT dongmengyi newpathwaysofmutationalchangeinsarscov2proteomesinvolveregionsofintrinsicdisorderimportantforvirusreplicationandrelease
AT bhatiagitanshu newpathwaysofmutationalchangeinsarscov2proteomesinvolveregionsofintrinsicdisorderimportantforvirusreplicationandrelease
AT norsworthymilesd newpathwaysofmutationalchangeinsarscov2proteomesinvolveregionsofintrinsicdisorderimportantforvirusreplicationandrelease
AT zhengxuying newpathwaysofmutationalchangeinsarscov2proteomesinvolveregionsofintrinsicdisorderimportantforvirusreplicationandrelease
AT caetanoanollesgustavo newpathwaysofmutationalchangeinsarscov2proteomesinvolveregionsofintrinsicdisorderimportantforvirusreplicationandrelease