Cargando…

Analysis of genomic distributions of SARS-CoV-2 reveals a dominant strain type with strong allelic associations

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causal agent of COVID 19, continues to evolve since its first emergence in December 2019. Using the complete sequences of 1,932 SARS-CoV-2 genomes, various clustering analyses consistently identified six types of the strains. Independ...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Hsin-Chou, Chen, Chun-houh, Wang, Jen-Hung, Liao, Hsiao-Chi, Yang, Chih-Ting, Chen, Chia-Wei, Lin, Yin-Chun, Kao, Chiun-How, Lu, Mei-Yeh Jade, Liao, James C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7720151/
https://www.ncbi.nlm.nih.gov/pubmed/33184173
http://dx.doi.org/10.1073/pnas.2007840117
_version_ 1783619807148507136
author Yang, Hsin-Chou
Chen, Chun-houh
Wang, Jen-Hung
Liao, Hsiao-Chi
Yang, Chih-Ting
Chen, Chia-Wei
Lin, Yin-Chun
Kao, Chiun-How
Lu, Mei-Yeh Jade
Liao, James C.
author_facet Yang, Hsin-Chou
Chen, Chun-houh
Wang, Jen-Hung
Liao, Hsiao-Chi
Yang, Chih-Ting
Chen, Chia-Wei
Lin, Yin-Chun
Kao, Chiun-How
Lu, Mei-Yeh Jade
Liao, James C.
author_sort Yang, Hsin-Chou
collection PubMed
description Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causal agent of COVID 19, continues to evolve since its first emergence in December 2019. Using the complete sequences of 1,932 SARS-CoV-2 genomes, various clustering analyses consistently identified six types of the strains. Independent of the dendrogram construction, 13 signature variations in the form of single nucleotide variations (SNVs) in protein coding regions and one SNV in the 5′ untranslated region (UTR) were identified and provided a direct interpretation for the six types (types I to VI). The six types of the strains and their underlying signature SNVs were validated in two subsequent analyses of 6,228 and 38,248 SARS-CoV-2 genomes which became available later. To date, type VI, characterized by the four signature SNVs C241T (5′UTR), C3037T (nsp3 F924F), C14408T (nsp12 P4715L), and A23403G (Spike D614G), with strong allelic associations, has become the dominant type. Since C241T is in the 5′ UTR with uncertain significance and the characteristics can be captured by the other three strongly associated SNVs, we focus on the other three. The increasing frequency of the type VI haplotype 3037T-14408T-23403G in the majority of the submitted samples in various countries suggests a possible fitness gain conferred by the type VI signature SNVs. The fact that strains missing one or two of these signature SNVs fail to persist implies possible interactions among these SNVs. Later SNVs such as G28881A, G28882A, and G28883C have emerged with strong allelic associations, forming new subtypes. This study suggests that SNVs may become an important consideration in SARS-CoV-2 classification and surveillance.
format Online
Article
Text
id pubmed-7720151
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-77201512020-12-18 Analysis of genomic distributions of SARS-CoV-2 reveals a dominant strain type with strong allelic associations Yang, Hsin-Chou Chen, Chun-houh Wang, Jen-Hung Liao, Hsiao-Chi Yang, Chih-Ting Chen, Chia-Wei Lin, Yin-Chun Kao, Chiun-How Lu, Mei-Yeh Jade Liao, James C. Proc Natl Acad Sci U S A Biological Sciences Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causal agent of COVID 19, continues to evolve since its first emergence in December 2019. Using the complete sequences of 1,932 SARS-CoV-2 genomes, various clustering analyses consistently identified six types of the strains. Independent of the dendrogram construction, 13 signature variations in the form of single nucleotide variations (SNVs) in protein coding regions and one SNV in the 5′ untranslated region (UTR) were identified and provided a direct interpretation for the six types (types I to VI). The six types of the strains and their underlying signature SNVs were validated in two subsequent analyses of 6,228 and 38,248 SARS-CoV-2 genomes which became available later. To date, type VI, characterized by the four signature SNVs C241T (5′UTR), C3037T (nsp3 F924F), C14408T (nsp12 P4715L), and A23403G (Spike D614G), with strong allelic associations, has become the dominant type. Since C241T is in the 5′ UTR with uncertain significance and the characteristics can be captured by the other three strongly associated SNVs, we focus on the other three. The increasing frequency of the type VI haplotype 3037T-14408T-23403G in the majority of the submitted samples in various countries suggests a possible fitness gain conferred by the type VI signature SNVs. The fact that strains missing one or two of these signature SNVs fail to persist implies possible interactions among these SNVs. Later SNVs such as G28881A, G28882A, and G28883C have emerged with strong allelic associations, forming new subtypes. This study suggests that SNVs may become an important consideration in SARS-CoV-2 classification and surveillance. National Academy of Sciences 2020-12-01 2020-11-12 /pmc/articles/PMC7720151/ /pubmed/33184173 http://dx.doi.org/10.1073/pnas.2007840117 Text en Copyright © 2020 the Author(s). Published by PNAS. http://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (http://creativecommons.org/licenses/by/4.0/) .
spellingShingle Biological Sciences
Yang, Hsin-Chou
Chen, Chun-houh
Wang, Jen-Hung
Liao, Hsiao-Chi
Yang, Chih-Ting
Chen, Chia-Wei
Lin, Yin-Chun
Kao, Chiun-How
Lu, Mei-Yeh Jade
Liao, James C.
Analysis of genomic distributions of SARS-CoV-2 reveals a dominant strain type with strong allelic associations
title Analysis of genomic distributions of SARS-CoV-2 reveals a dominant strain type with strong allelic associations
title_full Analysis of genomic distributions of SARS-CoV-2 reveals a dominant strain type with strong allelic associations
title_fullStr Analysis of genomic distributions of SARS-CoV-2 reveals a dominant strain type with strong allelic associations
title_full_unstemmed Analysis of genomic distributions of SARS-CoV-2 reveals a dominant strain type with strong allelic associations
title_short Analysis of genomic distributions of SARS-CoV-2 reveals a dominant strain type with strong allelic associations
title_sort analysis of genomic distributions of sars-cov-2 reveals a dominant strain type with strong allelic associations
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7720151/
https://www.ncbi.nlm.nih.gov/pubmed/33184173
http://dx.doi.org/10.1073/pnas.2007840117
work_keys_str_mv AT yanghsinchou analysisofgenomicdistributionsofsarscov2revealsadominantstraintypewithstrongallelicassociations
AT chenchunhouh analysisofgenomicdistributionsofsarscov2revealsadominantstraintypewithstrongallelicassociations
AT wangjenhung analysisofgenomicdistributionsofsarscov2revealsadominantstraintypewithstrongallelicassociations
AT liaohsiaochi analysisofgenomicdistributionsofsarscov2revealsadominantstraintypewithstrongallelicassociations
AT yangchihting analysisofgenomicdistributionsofsarscov2revealsadominantstraintypewithstrongallelicassociations
AT chenchiawei analysisofgenomicdistributionsofsarscov2revealsadominantstraintypewithstrongallelicassociations
AT linyinchun analysisofgenomicdistributionsofsarscov2revealsadominantstraintypewithstrongallelicassociations
AT kaochiunhow analysisofgenomicdistributionsofsarscov2revealsadominantstraintypewithstrongallelicassociations
AT lumeiyehjade analysisofgenomicdistributionsofsarscov2revealsadominantstraintypewithstrongallelicassociations
AT liaojamesc analysisofgenomicdistributionsofsarscov2revealsadominantstraintypewithstrongallelicassociations