Cargando…

Worldwide SARS-CoV-2 haplotype distribution in early pandemic

The world is experiencing one of the most severe viral outbreaks in the last few years, the pandemic infection by SARS-CoV-2, the causative agent of COVID-19 disease. As of December 10(th) 2021, the virus has spread worldwide, with a total number of more than 267 million of confirmed cases (four tim...

Descripción completa

Detalles Bibliográficos
Autores principales: Cairo, Andrea, Iorio, Marilena V., Spena, Silvia, Tagliabue, Elda, Peyvandi, Flora
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8849502/
https://www.ncbi.nlm.nih.gov/pubmed/35171928
http://dx.doi.org/10.1371/journal.pone.0263705
_version_ 1784652480109346816
author Cairo, Andrea
Iorio, Marilena V.
Spena, Silvia
Tagliabue, Elda
Peyvandi, Flora
author_facet Cairo, Andrea
Iorio, Marilena V.
Spena, Silvia
Tagliabue, Elda
Peyvandi, Flora
author_sort Cairo, Andrea
collection PubMed
description The world is experiencing one of the most severe viral outbreaks in the last few years, the pandemic infection by SARS-CoV-2, the causative agent of COVID-19 disease. As of December 10(th) 2021, the virus has spread worldwide, with a total number of more than 267 million of confirmed cases (four times more in the last year), and more than 5 million deaths. A great effort has been undertaken to molecularly characterize the virus, track the spreading of different variants across the globe with the aim to understand the potential effects in terms of transmission capability and different fatality rates. Here we focus on the genomic diversity and distribution of the virus in the early stages of the pandemic, to better characterize the origin of COVID-19 and to define the geographical and temporal evolution of genetic clades. By performing a comparative analysis of 75401 SARS-CoV-2 reported sequences (as of December 2020), using as reference the first viral sequence reported in Wuhan in December 2019, we described the existence of 26538 genetic variants, the most frequent clustering into four major clades characterized by a specific geographical distribution. Notably, we found the most frequent variant, the previously reported missense p.Asp614Gly in the S protein, as a single mutation in only three patients, whereas in the large majority of cases it occurs in concomitance with three other variants, suggesting a high linkage and that this variant alone might not provide a significant selective advantage to the virus. Moreover, we evaluated the presence and the distribution in our dataset of the mutations characterizing the so called “british variant”, identified at the beginning of 2021, and observed that 9 out of 17 are present only in few sequences, but never in linkage with each other, suggesting a synergistic effect in this new viral strain. In summary, this is a large-scale analysis of SARS-CoV-2 deposited sequences, with a particular focus on the geographical and temporal evolution of genetic clades in the early phase of COVID-19 pandemic.
format Online
Article
Text
id pubmed-8849502
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-88495022022-02-17 Worldwide SARS-CoV-2 haplotype distribution in early pandemic Cairo, Andrea Iorio, Marilena V. Spena, Silvia Tagliabue, Elda Peyvandi, Flora PLoS One Research Article The world is experiencing one of the most severe viral outbreaks in the last few years, the pandemic infection by SARS-CoV-2, the causative agent of COVID-19 disease. As of December 10(th) 2021, the virus has spread worldwide, with a total number of more than 267 million of confirmed cases (four times more in the last year), and more than 5 million deaths. A great effort has been undertaken to molecularly characterize the virus, track the spreading of different variants across the globe with the aim to understand the potential effects in terms of transmission capability and different fatality rates. Here we focus on the genomic diversity and distribution of the virus in the early stages of the pandemic, to better characterize the origin of COVID-19 and to define the geographical and temporal evolution of genetic clades. By performing a comparative analysis of 75401 SARS-CoV-2 reported sequences (as of December 2020), using as reference the first viral sequence reported in Wuhan in December 2019, we described the existence of 26538 genetic variants, the most frequent clustering into four major clades characterized by a specific geographical distribution. Notably, we found the most frequent variant, the previously reported missense p.Asp614Gly in the S protein, as a single mutation in only three patients, whereas in the large majority of cases it occurs in concomitance with three other variants, suggesting a high linkage and that this variant alone might not provide a significant selective advantage to the virus. Moreover, we evaluated the presence and the distribution in our dataset of the mutations characterizing the so called “british variant”, identified at the beginning of 2021, and observed that 9 out of 17 are present only in few sequences, but never in linkage with each other, suggesting a synergistic effect in this new viral strain. In summary, this is a large-scale analysis of SARS-CoV-2 deposited sequences, with a particular focus on the geographical and temporal evolution of genetic clades in the early phase of COVID-19 pandemic. Public Library of Science 2022-02-16 /pmc/articles/PMC8849502/ /pubmed/35171928 http://dx.doi.org/10.1371/journal.pone.0263705 Text en © 2022 Cairo et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cairo, Andrea
Iorio, Marilena V.
Spena, Silvia
Tagliabue, Elda
Peyvandi, Flora
Worldwide SARS-CoV-2 haplotype distribution in early pandemic
title Worldwide SARS-CoV-2 haplotype distribution in early pandemic
title_full Worldwide SARS-CoV-2 haplotype distribution in early pandemic
title_fullStr Worldwide SARS-CoV-2 haplotype distribution in early pandemic
title_full_unstemmed Worldwide SARS-CoV-2 haplotype distribution in early pandemic
title_short Worldwide SARS-CoV-2 haplotype distribution in early pandemic
title_sort worldwide sars-cov-2 haplotype distribution in early pandemic
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8849502/
https://www.ncbi.nlm.nih.gov/pubmed/35171928
http://dx.doi.org/10.1371/journal.pone.0263705
work_keys_str_mv AT cairoandrea worldwidesarscov2haplotypedistributioninearlypandemic
AT ioriomarilenav worldwidesarscov2haplotypedistributioninearlypandemic
AT spenasilvia worldwidesarscov2haplotypedistributioninearlypandemic
AT tagliabueelda worldwidesarscov2haplotypedistributioninearlypandemic
AT peyvandiflora worldwidesarscov2haplotypedistributioninearlypandemic