Cargando…

Emergence of genomic diversity and recurrent mutations in SARS-CoV-2

SARS-CoV-2 is a SARS-like coronavirus of likely zoonotic origin first identified in December 2019 in Wuhan, the capital of China's Hubei province. The virus has since spread globally, resulting in the currently ongoing COVID-19 pandemic. The first whole genome sequence was published on January...

Descripción completa

Detalles Bibliográficos
Autores principales: van Dorp, Lucy, Acman, Mislav, Richard, Damien, Shaw, Liam P., Ford, Charlotte E., Ormond, Louise, Owen, Christopher J., Pang, Juanita, Tan, Cedric C.S., Boshier, Florencia A.T., Ortiz, Arturo Torres, Balloux, François
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7199730/
https://www.ncbi.nlm.nih.gov/pubmed/32387564
http://dx.doi.org/10.1016/j.meegid.2020.104351
_version_ 1783529204716929024
author van Dorp, Lucy
Acman, Mislav
Richard, Damien
Shaw, Liam P.
Ford, Charlotte E.
Ormond, Louise
Owen, Christopher J.
Pang, Juanita
Tan, Cedric C.S.
Boshier, Florencia A.T.
Ortiz, Arturo Torres
Balloux, François
author_facet van Dorp, Lucy
Acman, Mislav
Richard, Damien
Shaw, Liam P.
Ford, Charlotte E.
Ormond, Louise
Owen, Christopher J.
Pang, Juanita
Tan, Cedric C.S.
Boshier, Florencia A.T.
Ortiz, Arturo Torres
Balloux, François
author_sort van Dorp, Lucy
collection PubMed
description SARS-CoV-2 is a SARS-like coronavirus of likely zoonotic origin first identified in December 2019 in Wuhan, the capital of China's Hubei province. The virus has since spread globally, resulting in the currently ongoing COVID-19 pandemic. The first whole genome sequence was published on January 5 2020, and thousands of genomes have been sequenced since this date. This resource allows unprecedented insights into the past demography of SARS-CoV-2 but also monitoring of how the virus is adapting to its novel human host, providing information to direct drug and vaccine design. We curated a dataset of 7666 public genome assemblies and analysed the emergence of genomic diversity over time. Our results are in line with previous estimates and point to all sequences sharing a common ancestor towards the end of 2019, supporting this as the period when SARS-CoV-2 jumped into its human host. Due to extensive transmission, the genetic diversity of the virus in several countries recapitulates a large fraction of its worldwide genetic diversity. We identify regions of the SARS-CoV-2 genome that have remained largely invariant to date, and others that have already accumulated diversity. By focusing on mutations which have emerged independently multiple times (homoplasies), we identify 198 filtered recurrent mutations in the SARS-CoV-2 genome. Nearly 80% of the recurrent mutations produced non-synonymous changes at the protein level, suggesting possible ongoing adaptation of SARS-CoV-2. Three sites in Orf1ab in the regions encoding Nsp6, Nsp11, Nsp13, and one in the Spike protein are characterised by a particularly large number of recurrent mutations (>15 events) which may signpost convergent evolution and are of particular interest in the context of adaptation of SARS-CoV-2 to the human host. We additionally provide an interactive user-friendly web-application to query the alignment of the 7666 SARS-CoV-2 genomes.
format Online
Article
Text
id pubmed-7199730
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-71997302020-05-06 Emergence of genomic diversity and recurrent mutations in SARS-CoV-2 van Dorp, Lucy Acman, Mislav Richard, Damien Shaw, Liam P. Ford, Charlotte E. Ormond, Louise Owen, Christopher J. Pang, Juanita Tan, Cedric C.S. Boshier, Florencia A.T. Ortiz, Arturo Torres Balloux, François Infect Genet Evol Article SARS-CoV-2 is a SARS-like coronavirus of likely zoonotic origin first identified in December 2019 in Wuhan, the capital of China's Hubei province. The virus has since spread globally, resulting in the currently ongoing COVID-19 pandemic. The first whole genome sequence was published on January 5 2020, and thousands of genomes have been sequenced since this date. This resource allows unprecedented insights into the past demography of SARS-CoV-2 but also monitoring of how the virus is adapting to its novel human host, providing information to direct drug and vaccine design. We curated a dataset of 7666 public genome assemblies and analysed the emergence of genomic diversity over time. Our results are in line with previous estimates and point to all sequences sharing a common ancestor towards the end of 2019, supporting this as the period when SARS-CoV-2 jumped into its human host. Due to extensive transmission, the genetic diversity of the virus in several countries recapitulates a large fraction of its worldwide genetic diversity. We identify regions of the SARS-CoV-2 genome that have remained largely invariant to date, and others that have already accumulated diversity. By focusing on mutations which have emerged independently multiple times (homoplasies), we identify 198 filtered recurrent mutations in the SARS-CoV-2 genome. Nearly 80% of the recurrent mutations produced non-synonymous changes at the protein level, suggesting possible ongoing adaptation of SARS-CoV-2. Three sites in Orf1ab in the regions encoding Nsp6, Nsp11, Nsp13, and one in the Spike protein are characterised by a particularly large number of recurrent mutations (>15 events) which may signpost convergent evolution and are of particular interest in the context of adaptation of SARS-CoV-2 to the human host. We additionally provide an interactive user-friendly web-application to query the alignment of the 7666 SARS-CoV-2 genomes. Elsevier B.V. 2020-09 2020-05-05 /pmc/articles/PMC7199730/ /pubmed/32387564 http://dx.doi.org/10.1016/j.meegid.2020.104351 Text en © 2020 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
van Dorp, Lucy
Acman, Mislav
Richard, Damien
Shaw, Liam P.
Ford, Charlotte E.
Ormond, Louise
Owen, Christopher J.
Pang, Juanita
Tan, Cedric C.S.
Boshier, Florencia A.T.
Ortiz, Arturo Torres
Balloux, François
Emergence of genomic diversity and recurrent mutations in SARS-CoV-2
title Emergence of genomic diversity and recurrent mutations in SARS-CoV-2
title_full Emergence of genomic diversity and recurrent mutations in SARS-CoV-2
title_fullStr Emergence of genomic diversity and recurrent mutations in SARS-CoV-2
title_full_unstemmed Emergence of genomic diversity and recurrent mutations in SARS-CoV-2
title_short Emergence of genomic diversity and recurrent mutations in SARS-CoV-2
title_sort emergence of genomic diversity and recurrent mutations in sars-cov-2
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7199730/
https://www.ncbi.nlm.nih.gov/pubmed/32387564
http://dx.doi.org/10.1016/j.meegid.2020.104351
work_keys_str_mv AT vandorplucy emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT acmanmislav emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT richarddamien emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT shawliamp emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT fordcharlottee emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT ormondlouise emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT owenchristopherj emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT pangjuanita emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT tancedriccs emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT boshierflorenciaat emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT ortizarturotorres emergenceofgenomicdiversityandrecurrentmutationsinsarscov2
AT ballouxfrancois emergenceofgenomicdiversityandrecurrentmutationsinsarscov2