Cargando…

Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies

BACKGROUND: The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients...

Descripción completa

Detalles Bibliográficos
Autores principales: Lau, Billy T., Pavlichin, Dmitri, Hooker, Anna C., Almeda, Alison, Shin, Giwon, Chen, Jiamin, Sahoo, Malaya K., Huang, Chun Hong, Pinsky, Benjamin A., Lee, Ho Joon, Ji, Hanlee P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8054698/
https://www.ncbi.nlm.nih.gov/pubmed/33875001
http://dx.doi.org/10.1186/s13073-021-00882-2
_version_ 1783680336364830720
author Lau, Billy T.
Pavlichin, Dmitri
Hooker, Anna C.
Almeda, Alison
Shin, Giwon
Chen, Jiamin
Sahoo, Malaya K.
Huang, Chun Hong
Pinsky, Benjamin A.
Lee, Ho Joon
Ji, Hanlee P.
author_facet Lau, Billy T.
Pavlichin, Dmitri
Hooker, Anna C.
Almeda, Alison
Shin, Giwon
Chen, Jiamin
Sahoo, Malaya K.
Huang, Chun Hong
Pinsky, Benjamin A.
Lee, Ho Joon
Ji, Hanlee P.
author_sort Lau, Billy T.
collection PubMed
description BACKGROUND: The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients. In combination, these sets of viral mutations provide distinct genetic fingerprints that reveal the patterns of transmission and have utility in contact tracing. METHODS: Leveraging thousands of sequenced SARS-CoV-2 genomes, we performed a viral pangenome analysis to identify conserved genomic sequences. We used a rapid and highly efficient computational approach that relies on k-mers, short tracts of sequence, instead of conventional sequence alignment. Using this method, we annotated viral mutation signatures that were associated with specific strains. Based on these highly conserved viral sequences, we developed a rapid and highly scalable targeted sequencing assay to identify mutations, detect quasispecies variants, and identify mutation signatures from patients. These results were compared to the pangenome genetic fingerprints. RESULTS: We built a k-mer index for thousands of SARS-CoV-2 genomes and identified conserved genomics regions and landscape of mutations across thousands of virus genomes. We delineated mutation profiles spanning common genetic fingerprints (the combination of mutations in a viral assembly) and a combination of mutations that appear in only a small number of patients. We developed a targeted sequencing assay by selecting primers from the conserved viral genome regions to flank frequent mutations. Using a cohort of 100 SARS-CoV-2 clinical samples, we identified genetic fingerprints consisting of strain-specific mutations seen across populations and de novo quasispecies mutations localized to individual infections. We compared the mutation profiles of viral samples undergoing analysis with the features of the pangenome. CONCLUSIONS: We conducted an analysis for viral mutation profiles that provide the basis of genetic fingerprints. Our study linked pangenome analysis with targeted deep sequenced SARS-CoV-2 clinical samples. We identified quasispecies mutations occurring within individual patients and determined their general prevalence when compared to over 70,000 other strains. Analysis of these genetic fingerprints may provide a way of conducting molecular contact tracing.
format Online
Article
Text
id pubmed-8054698
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-80546982021-04-20 Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies Lau, Billy T. Pavlichin, Dmitri Hooker, Anna C. Almeda, Alison Shin, Giwon Chen, Jiamin Sahoo, Malaya K. Huang, Chun Hong Pinsky, Benjamin A. Lee, Ho Joon Ji, Hanlee P. Genome Med Research BACKGROUND: The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients. In combination, these sets of viral mutations provide distinct genetic fingerprints that reveal the patterns of transmission and have utility in contact tracing. METHODS: Leveraging thousands of sequenced SARS-CoV-2 genomes, we performed a viral pangenome analysis to identify conserved genomic sequences. We used a rapid and highly efficient computational approach that relies on k-mers, short tracts of sequence, instead of conventional sequence alignment. Using this method, we annotated viral mutation signatures that were associated with specific strains. Based on these highly conserved viral sequences, we developed a rapid and highly scalable targeted sequencing assay to identify mutations, detect quasispecies variants, and identify mutation signatures from patients. These results were compared to the pangenome genetic fingerprints. RESULTS: We built a k-mer index for thousands of SARS-CoV-2 genomes and identified conserved genomics regions and landscape of mutations across thousands of virus genomes. We delineated mutation profiles spanning common genetic fingerprints (the combination of mutations in a viral assembly) and a combination of mutations that appear in only a small number of patients. We developed a targeted sequencing assay by selecting primers from the conserved viral genome regions to flank frequent mutations. Using a cohort of 100 SARS-CoV-2 clinical samples, we identified genetic fingerprints consisting of strain-specific mutations seen across populations and de novo quasispecies mutations localized to individual infections. We compared the mutation profiles of viral samples undergoing analysis with the features of the pangenome. CONCLUSIONS: We conducted an analysis for viral mutation profiles that provide the basis of genetic fingerprints. Our study linked pangenome analysis with targeted deep sequenced SARS-CoV-2 clinical samples. We identified quasispecies mutations occurring within individual patients and determined their general prevalence when compared to over 70,000 other strains. Analysis of these genetic fingerprints may provide a way of conducting molecular contact tracing. BioMed Central 2021-04-19 /pmc/articles/PMC8054698/ /pubmed/33875001 http://dx.doi.org/10.1186/s13073-021-00882-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Lau, Billy T.
Pavlichin, Dmitri
Hooker, Anna C.
Almeda, Alison
Shin, Giwon
Chen, Jiamin
Sahoo, Malaya K.
Huang, Chun Hong
Pinsky, Benjamin A.
Lee, Ho Joon
Ji, Hanlee P.
Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies
title Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies
title_full Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies
title_fullStr Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies
title_full_unstemmed Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies
title_short Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies
title_sort profiling sars-cov-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8054698/
https://www.ncbi.nlm.nih.gov/pubmed/33875001
http://dx.doi.org/10.1186/s13073-021-00882-2
work_keys_str_mv AT laubillyt profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies
AT pavlichindmitri profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies
AT hookerannac profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies
AT almedaalison profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies
AT shingiwon profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies
AT chenjiamin profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies
AT sahoomalayak profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies
AT huangchunhong profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies
AT pinskybenjamina profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies
AT leehojoon profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies
AT jihanleep profilingsarscov2mutationfingerprintsthatrangefromtheviralpangenometoindividualinfectionquasispecies