Cargando…
Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies
BACKGROUND: The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8054698/ https://www.ncbi.nlm.nih.gov/pubmed/33875001 http://dx.doi.org/10.1186/s13073-021-00882-2 |
_version_ | 1783680336364830720 |
---|---|
author | Lau, Billy T. Pavlichin, Dmitri Hooker, Anna C. Almeda, Alison Shin, Giwon Chen, Jiamin Sahoo, Malaya K. Huang, Chun Hong Pinsky, Benjamin A. Lee, Ho Joon Ji, Hanlee P. |
author_facet | Lau, Billy T. Pavlichin, Dmitri Hooker, Anna C. Almeda, Alison Shin, Giwon Chen, Jiamin Sahoo, Malaya K. Huang, Chun Hong Pinsky, Benjamin A. Lee, Ho Joon Ji, Hanlee P. |
author_sort | Lau, Billy T. |
collection | PubMed |
description | BACKGROUND: The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients. In combination, these sets of viral mutations provide distinct genetic fingerprints that reveal the patterns of transmission and have utility in contact tracing. METHODS: Leveraging thousands of sequenced SARS-CoV-2 genomes, we performed a viral pangenome analysis to identify conserved genomic sequences. We used a rapid and highly efficient computational approach that relies on k-mers, short tracts of sequence, instead of conventional sequence alignment. Using this method, we annotated viral mutation signatures that were associated with specific strains. Based on these highly conserved viral sequences, we developed a rapid and highly scalable targeted sequencing assay to identify mutations, detect quasispecies variants, and identify mutation signatures from patients. These results were compared to the pangenome genetic fingerprints. RESULTS: We built a k-mer index for thousands of SARS-CoV-2 genomes and identified conserved genomics regions and landscape of mutations across thousands of virus genomes. We delineated mutation profiles spanning common genetic fingerprints (the combination of mutations in a viral assembly) and a combination of mutations that appear in only a small number of patients. We developed a targeted sequencing assay by selecting primers from the conserved viral genome regions to flank frequent mutations. Using a cohort of 100 SARS-CoV-2 clinical samples, we identified genetic fingerprints consisting of strain-specific mutations seen across populations and de novo quasispecies mutations localized to individual infections. We compared the mutation profiles of viral samples undergoing analysis with the features of the pangenome. CONCLUSIONS: We conducted an analysis for viral mutation profiles that provide the basis of genetic fingerprints. Our study linked pangenome analysis with targeted deep sequenced SARS-CoV-2 clinical samples. We identified quasispecies mutations occurring within individual patients and determined their general prevalence when compared to over 70,000 other strains. Analysis of these genetic fingerprints may provide a way of conducting molecular contact tracing. |
format | Online Article Text |
id | pubmed-8054698 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-80546982021-04-20 Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies Lau, Billy T. Pavlichin, Dmitri Hooker, Anna C. Almeda, Alison Shin, Giwon Chen, Jiamin Sahoo, Malaya K. Huang, Chun Hong Pinsky, Benjamin A. Lee, Ho Joon Ji, Hanlee P. Genome Med Research BACKGROUND: The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients. In combination, these sets of viral mutations provide distinct genetic fingerprints that reveal the patterns of transmission and have utility in contact tracing. METHODS: Leveraging thousands of sequenced SARS-CoV-2 genomes, we performed a viral pangenome analysis to identify conserved genomic sequences. We used a rapid and highly efficient computational approach that relies on k-mers, short tracts of sequence, instead of conventional sequence alignment. Using this method, we annotated viral mutation signatures that were associated with specific strains. Based on these highly conserved viral sequences, we developed a rapid and highly scalable targeted sequencing assay to identify mutations, detect quasispecies variants, and identify mutation signatures from patients. These results were compared to the pangenome genetic fingerprints. RESULTS: We built a k-mer index for thousands of SARS-CoV-2 genomes and identified conserved genomics regions and landscape of mutations across thousands of virus genomes. We delineated mutation profiles spanning common genetic fingerprints (the combination of mutations in a viral assembly) and a combination of mutations that appear in only a small number of patients. We developed a targeted sequencing assay by selecting primers from the conserved viral genome regions to flank frequent mutations. Using a cohort of 100 SARS-CoV-2 clinical samples, we identified genetic fingerprints consisting of strain-specific mutations seen across populations and de novo quasispecies mutations localized to individual infections. We compared the mutation profiles of viral samples undergoing analysis with the features of the pangenome. CONCLUSIONS: We conducted an analysis for viral mutation profiles that provide the basis of genetic fingerprints. Our study linked pangenome analysis with targeted deep sequenced SARS-CoV-2 clinical samples. We identified quasispecies mutations occurring within individual patients and determined their general prevalence when compared to over 70,000 other strains. Analysis of these genetic fingerprints may provide a way of conducting molecular contact tracing. BioMed Central 2021-04-19 /pmc/articles/PMC8054698/ /pubmed/33875001 http://dx.doi.org/10.1186/s13073-021-00882-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Lau, Billy T. Pavlichin, Dmitri Hooker, Anna C. Almeda, Alison Shin, Giwon Chen, Jiamin Sahoo, Malaya K. Huang, Chun Hong Pinsky, Benjamin A. Lee, Ho Joon Ji, Hanlee P. Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies |
title | Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies |
title_full | Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies |
title_fullStr | Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies |
title_full_unstemmed | Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies |
title_short | Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies |
title_sort | profiling sars-cov-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8054698/ https://www.ncbi.nlm.nih.gov/pubmed/33875001 http://dx.doi.org/10.1186/s13073-021-00882-2 |
work_keys_str_mv | AT laubillyt profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies AT pavlichindmitri profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies AT hookerannac profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies AT almedaalison profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies AT shingiwon profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies AT chenjiamin profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies AT sahoomalayak profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies AT huangchunhong profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies AT pinskybenjamina profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies AT leehojoon profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies AT jihanleep profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies |