Cargando…

Potentially adaptive SARS-CoV-2 mutations discovered with novel spatiotemporal and explainable AI models

BACKGROUND: A mechanistic understanding of the spread of SARS-CoV-2 and diligent tracking of ongoing mutagenesis are of key importance to plan robust strategies for confining its transmission. Large numbers of available sequences and their dates of transmission provide an unprecedented opportunity t...

Descripción completa

Detalles Bibliográficos
Autores principales: Garvin, Michael R., T. Prates, Erica, Pavicic, Mirko, Jones, Piet, Amos, B. Kirtley, Geiger, Armin, Shah, Manesh B., Streich, Jared, Felipe Machado Gazolla, Joao Gabriel, Kainer, David, Cliff, Ashley, Romero, Jonathon, Keith, Nathan, Brown, James B., Jacobson, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7756312/
https://www.ncbi.nlm.nih.gov/pubmed/33357233
http://dx.doi.org/10.1186/s13059-020-02191-0
_version_ 1783626513119182848
author Garvin, Michael R.
T. Prates, Erica
Pavicic, Mirko
Jones, Piet
Amos, B. Kirtley
Geiger, Armin
Shah, Manesh B.
Streich, Jared
Felipe Machado Gazolla, Joao Gabriel
Kainer, David
Cliff, Ashley
Romero, Jonathon
Keith, Nathan
Brown, James B.
Jacobson, Daniel
author_facet Garvin, Michael R.
T. Prates, Erica
Pavicic, Mirko
Jones, Piet
Amos, B. Kirtley
Geiger, Armin
Shah, Manesh B.
Streich, Jared
Felipe Machado Gazolla, Joao Gabriel
Kainer, David
Cliff, Ashley
Romero, Jonathon
Keith, Nathan
Brown, James B.
Jacobson, Daniel
author_sort Garvin, Michael R.
collection PubMed
description BACKGROUND: A mechanistic understanding of the spread of SARS-CoV-2 and diligent tracking of ongoing mutagenesis are of key importance to plan robust strategies for confining its transmission. Large numbers of available sequences and their dates of transmission provide an unprecedented opportunity to analyze evolutionary adaptation in novel ways. Addition of high-resolution structural information can reveal the functional basis of these processes at the molecular level. Integrated systems biology-directed analyses of these data layers afford valuable insights to build a global understanding of the COVID-19 pandemic. RESULTS: Here we identify globally distributed haplotypes from 15,789 SARS-CoV-2 genomes and model their success based on their duration, dispersal, and frequency in the host population. Our models identify mutations that are likely compensatory adaptive changes that allowed for rapid expansion of the virus. Functional predictions from structural analyses indicate that, contrary to previous reports, the Asp(614)Gly mutation in the spike glycoprotein (S) likely reduced transmission and the subsequent Pro(323)Leu mutation in the RNA-dependent RNA polymerase led to the precipitous spread of the virus. Our model also suggests that two mutations in the nsp13 helicase allowed for the adaptation of the virus to the Pacific Northwest of the USA. Finally, our explainable artificial intelligence algorithm identified a mutational hotspot in the sequence of S that also displays a signature of positive selection and may have implications for tissue or cell-specific expression of the virus. CONCLUSIONS: These results provide valuable insights for the development of drugs and surveillance strategies to combat the current and future pandemics.
format Online
Article
Text
id pubmed-7756312
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77563122020-12-23 Potentially adaptive SARS-CoV-2 mutations discovered with novel spatiotemporal and explainable AI models Garvin, Michael R. T. Prates, Erica Pavicic, Mirko Jones, Piet Amos, B. Kirtley Geiger, Armin Shah, Manesh B. Streich, Jared Felipe Machado Gazolla, Joao Gabriel Kainer, David Cliff, Ashley Romero, Jonathon Keith, Nathan Brown, James B. Jacobson, Daniel Genome Biol Research BACKGROUND: A mechanistic understanding of the spread of SARS-CoV-2 and diligent tracking of ongoing mutagenesis are of key importance to plan robust strategies for confining its transmission. Large numbers of available sequences and their dates of transmission provide an unprecedented opportunity to analyze evolutionary adaptation in novel ways. Addition of high-resolution structural information can reveal the functional basis of these processes at the molecular level. Integrated systems biology-directed analyses of these data layers afford valuable insights to build a global understanding of the COVID-19 pandemic. RESULTS: Here we identify globally distributed haplotypes from 15,789 SARS-CoV-2 genomes and model their success based on their duration, dispersal, and frequency in the host population. Our models identify mutations that are likely compensatory adaptive changes that allowed for rapid expansion of the virus. Functional predictions from structural analyses indicate that, contrary to previous reports, the Asp(614)Gly mutation in the spike glycoprotein (S) likely reduced transmission and the subsequent Pro(323)Leu mutation in the RNA-dependent RNA polymerase led to the precipitous spread of the virus. Our model also suggests that two mutations in the nsp13 helicase allowed for the adaptation of the virus to the Pacific Northwest of the USA. Finally, our explainable artificial intelligence algorithm identified a mutational hotspot in the sequence of S that also displays a signature of positive selection and may have implications for tissue or cell-specific expression of the virus. CONCLUSIONS: These results provide valuable insights for the development of drugs and surveillance strategies to combat the current and future pandemics. BioMed Central 2020-12-23 /pmc/articles/PMC7756312/ /pubmed/33357233 http://dx.doi.org/10.1186/s13059-020-02191-0 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Garvin, Michael R.
T. Prates, Erica
Pavicic, Mirko
Jones, Piet
Amos, B. Kirtley
Geiger, Armin
Shah, Manesh B.
Streich, Jared
Felipe Machado Gazolla, Joao Gabriel
Kainer, David
Cliff, Ashley
Romero, Jonathon
Keith, Nathan
Brown, James B.
Jacobson, Daniel
Potentially adaptive SARS-CoV-2 mutations discovered with novel spatiotemporal and explainable AI models
title Potentially adaptive SARS-CoV-2 mutations discovered with novel spatiotemporal and explainable AI models
title_full Potentially adaptive SARS-CoV-2 mutations discovered with novel spatiotemporal and explainable AI models
title_fullStr Potentially adaptive SARS-CoV-2 mutations discovered with novel spatiotemporal and explainable AI models
title_full_unstemmed Potentially adaptive SARS-CoV-2 mutations discovered with novel spatiotemporal and explainable AI models
title_short Potentially adaptive SARS-CoV-2 mutations discovered with novel spatiotemporal and explainable AI models
title_sort potentially adaptive sars-cov-2 mutations discovered with novel spatiotemporal and explainable ai models
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7756312/
https://www.ncbi.nlm.nih.gov/pubmed/33357233
http://dx.doi.org/10.1186/s13059-020-02191-0
work_keys_str_mv AT garvinmichaelr potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT tprateserica potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT pavicicmirko potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT jonespiet potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT amosbkirtley potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT geigerarmin potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT shahmaneshb potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT streichjared potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT felipemachadogazollajoaogabriel potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT kainerdavid potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT cliffashley potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT romerojonathon potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT keithnathan potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT brownjamesb potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels
AT jacobsondaniel potentiallyadaptivesarscov2mutationsdiscoveredwithnovelspatiotemporalandexplainableaimodels