Cargando…

Targeted Sanger sequencing to recover key mutations in SARS-CoV-2 variant genome assemblies produced by next-generation sequencing

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is adaptively evolving to ensure its persistence within human hosts. It is therefore necessary to continuously monitor the emergence and prevalence of novel variants that arise. Importantly, some mutations have been associated with both mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Lavanya, San, James E., Tegally, Houriiyah, Brzoska, Pius M., Anyaneji, Ugochukwu J., Wilkinson, Eduan, Clark, Lindsay, Giandhari, Jennifer, Pillay, Sureshnee, Lessells, Richard J., Martin, Darren Patrick, Furtado, Manohar, Kiran, Anmol M., de Oliveira, Tulio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9176282/
https://www.ncbi.nlm.nih.gov/pubmed/35294336
http://dx.doi.org/10.1099/mgen.0.000774
_version_ 1784722631826604032
author Singh, Lavanya
San, James E.
Tegally, Houriiyah
Brzoska, Pius M.
Anyaneji, Ugochukwu J.
Wilkinson, Eduan
Clark, Lindsay
Giandhari, Jennifer
Pillay, Sureshnee
Lessells, Richard J.
Martin, Darren Patrick
Furtado, Manohar
Kiran, Anmol M.
de Oliveira, Tulio
author_facet Singh, Lavanya
San, James E.
Tegally, Houriiyah
Brzoska, Pius M.
Anyaneji, Ugochukwu J.
Wilkinson, Eduan
Clark, Lindsay
Giandhari, Jennifer
Pillay, Sureshnee
Lessells, Richard J.
Martin, Darren Patrick
Furtado, Manohar
Kiran, Anmol M.
de Oliveira, Tulio
author_sort Singh, Lavanya
collection PubMed
description Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is adaptively evolving to ensure its persistence within human hosts. It is therefore necessary to continuously monitor the emergence and prevalence of novel variants that arise. Importantly, some mutations have been associated with both molecular diagnostic failures and reduced or abrogated next-generation sequencing (NGS) read coverage in some genomic regions. Such impacts are particularly problematic when they occur in genomic regions such as those that encode the spike (S) protein, which are crucial for identifying and tracking the prevalence and dissemination dynamics of concerning viral variants. Targeted Sanger sequencing presents a fast and cost-effective means to accurately extend the coverage of whole-genome sequences. We designed a custom set of primers to amplify a 401 bp segment of the receptor-binding domain (RBD) (between positions 22698 and 23098 relative to the Wuhan-Hu-1 reference). We then designed a Sanger sequencing wet-laboratory protocol. We applied the primer set and wet-laboratory protocol to sequence 222 samples that were missing positions with key mutations K417N, E484K, and N501Y due to poor coverage after NGS sequencing. Finally, we developed SeqPatcher, a Python-based computational tool to analyse the trace files yielded by Sanger sequencing to generate consensus sequences, or take preanalysed consensus sequences in fasta format, and merge them with their corresponding whole-genome assemblies. We successfully sequenced 153 samples of 222 (69 %) using Sanger sequencing and confirmed the occurrence of key beta variant mutations (K417N, E484K, N501Y) in the S genes of 142 of 153 (93 %) samples. Additionally, one sample had the Y508F mutation and four samples the S477N. Samples with RT-PCR C (t) scores ranging from 13.85 to 37.47 (mean=25.70) could be Sanger sequenced efficiently. These results show that our method and pipeline can be used to improve the quality of whole-genome assemblies produced using NGS and can be used with any pairs of the most used NGS and Sanger sequencing platforms.
format Online
Article
Text
id pubmed-9176282
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-91762822022-06-09 Targeted Sanger sequencing to recover key mutations in SARS-CoV-2 variant genome assemblies produced by next-generation sequencing Singh, Lavanya San, James E. Tegally, Houriiyah Brzoska, Pius M. Anyaneji, Ugochukwu J. Wilkinson, Eduan Clark, Lindsay Giandhari, Jennifer Pillay, Sureshnee Lessells, Richard J. Martin, Darren Patrick Furtado, Manohar Kiran, Anmol M. de Oliveira, Tulio Microb Genom Research Articles Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is adaptively evolving to ensure its persistence within human hosts. It is therefore necessary to continuously monitor the emergence and prevalence of novel variants that arise. Importantly, some mutations have been associated with both molecular diagnostic failures and reduced or abrogated next-generation sequencing (NGS) read coverage in some genomic regions. Such impacts are particularly problematic when they occur in genomic regions such as those that encode the spike (S) protein, which are crucial for identifying and tracking the prevalence and dissemination dynamics of concerning viral variants. Targeted Sanger sequencing presents a fast and cost-effective means to accurately extend the coverage of whole-genome sequences. We designed a custom set of primers to amplify a 401 bp segment of the receptor-binding domain (RBD) (between positions 22698 and 23098 relative to the Wuhan-Hu-1 reference). We then designed a Sanger sequencing wet-laboratory protocol. We applied the primer set and wet-laboratory protocol to sequence 222 samples that were missing positions with key mutations K417N, E484K, and N501Y due to poor coverage after NGS sequencing. Finally, we developed SeqPatcher, a Python-based computational tool to analyse the trace files yielded by Sanger sequencing to generate consensus sequences, or take preanalysed consensus sequences in fasta format, and merge them with their corresponding whole-genome assemblies. We successfully sequenced 153 samples of 222 (69 %) using Sanger sequencing and confirmed the occurrence of key beta variant mutations (K417N, E484K, N501Y) in the S genes of 142 of 153 (93 %) samples. Additionally, one sample had the Y508F mutation and four samples the S477N. Samples with RT-PCR C (t) scores ranging from 13.85 to 37.47 (mean=25.70) could be Sanger sequenced efficiently. These results show that our method and pipeline can be used to improve the quality of whole-genome assemblies produced using NGS and can be used with any pairs of the most used NGS and Sanger sequencing platforms. Microbiology Society 2022-03-16 /pmc/articles/PMC9176282/ /pubmed/35294336 http://dx.doi.org/10.1099/mgen.0.000774 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License. This article was made open access via a Publish and Read agreement between the Microbiology Society and the corresponding author’s institution.
spellingShingle Research Articles
Singh, Lavanya
San, James E.
Tegally, Houriiyah
Brzoska, Pius M.
Anyaneji, Ugochukwu J.
Wilkinson, Eduan
Clark, Lindsay
Giandhari, Jennifer
Pillay, Sureshnee
Lessells, Richard J.
Martin, Darren Patrick
Furtado, Manohar
Kiran, Anmol M.
de Oliveira, Tulio
Targeted Sanger sequencing to recover key mutations in SARS-CoV-2 variant genome assemblies produced by next-generation sequencing
title Targeted Sanger sequencing to recover key mutations in SARS-CoV-2 variant genome assemblies produced by next-generation sequencing
title_full Targeted Sanger sequencing to recover key mutations in SARS-CoV-2 variant genome assemblies produced by next-generation sequencing
title_fullStr Targeted Sanger sequencing to recover key mutations in SARS-CoV-2 variant genome assemblies produced by next-generation sequencing
title_full_unstemmed Targeted Sanger sequencing to recover key mutations in SARS-CoV-2 variant genome assemblies produced by next-generation sequencing
title_short Targeted Sanger sequencing to recover key mutations in SARS-CoV-2 variant genome assemblies produced by next-generation sequencing
title_sort targeted sanger sequencing to recover key mutations in sars-cov-2 variant genome assemblies produced by next-generation sequencing
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9176282/
https://www.ncbi.nlm.nih.gov/pubmed/35294336
http://dx.doi.org/10.1099/mgen.0.000774
work_keys_str_mv AT singhlavanya targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT sanjamese targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT tegallyhouriiyah targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT brzoskapiusm targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT anyanejiugochukwuj targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT wilkinsoneduan targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT clarklindsay targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT giandharijennifer targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT pillaysureshnee targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT lessellsrichardj targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT martindarrenpatrick targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT furtadomanohar targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT kirananmolm targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing
AT deoliveiratulio targetedsangersequencingtorecoverkeymutationsinsarscov2variantgenomeassembliesproducedbynextgenerationsequencing