Cargando…

A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome

High viral transmission in the COVID-19 pandemic has enabled SARS‐CoV‐2 to acquire new mutations that may impact genome sequencing methods. The ARTIC.v3 primer pool that amplifies short amplicons in a multiplex-PCR reaction is one of the most widely used methods for sequencing the SARS-CoV-2 genome....

Descripción completa

Detalles Bibliográficos
Autores principales: Arana, Carlos, Liang, Chaoying, Brock, Matthew, Zhang, Bo, Zhou, Jinchun, Chen, Li, Cantarel, Brandi, SoRelle, Jeffrey, Hooper, Lora V., Raj, Prithvi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8757904/
https://www.ncbi.nlm.nih.gov/pubmed/35025877
http://dx.doi.org/10.1371/journal.pone.0261014
_version_ 1784632783546613760
author Arana, Carlos
Liang, Chaoying
Brock, Matthew
Zhang, Bo
Zhou, Jinchun
Chen, Li
Cantarel, Brandi
SoRelle, Jeffrey
Hooper, Lora V.
Raj, Prithvi
author_facet Arana, Carlos
Liang, Chaoying
Brock, Matthew
Zhang, Bo
Zhou, Jinchun
Chen, Li
Cantarel, Brandi
SoRelle, Jeffrey
Hooper, Lora V.
Raj, Prithvi
author_sort Arana, Carlos
collection PubMed
description High viral transmission in the COVID-19 pandemic has enabled SARS‐CoV‐2 to acquire new mutations that may impact genome sequencing methods. The ARTIC.v3 primer pool that amplifies short amplicons in a multiplex-PCR reaction is one of the most widely used methods for sequencing the SARS-CoV-2 genome. We observed that some genomic intervals are poorly captured with ARTIC primers. To improve the genomic coverage and variant detection across these intervals, we designed long amplicon primers and evaluated the performance of a short (ARTIC) plus long amplicon (MRL) sequencing approach. Sequencing assays were optimized on VR-1986D-ATCC RNA followed by sequencing of nasopharyngeal swab specimens from fifteen COVID-19 positive patients. ARTIC data covered 94.47% of the virus genome fraction in the positive control and patient samples. Variant analysis in the ARTIC data detected 217 mutations, including 209 single nucleotide variants (SNVs) and eight insertions & deletions. On the other hand, long-amplicon data detected 156 mutations, of which 80% were concordant with ARTIC data. Combined analysis of ARTIC + MRL data improved the genomic coverage to 97.03% and identified 214 high confidence mutations. The combined final set of 214 mutations included 203 SNVs, 8 deletions and 3 insertions. Analysis showed 26 SARS-CoV-2 lineage defining mutations including 4 known variants of concern K417N, E484K, N501Y, P618H in spike gene. Hybrid analysis identified 7 nonsynonymous and 5 synonymous mutations across the genome that were either ambiguous or not called in ARTIC data. For example, G172V mutation in the ORF3a protein and A2A mutation in Membrane protein were missed by the ARTIC assay. Thus, we show that while the short amplicon (ARTIC) assay provides good genomic coverage with high throughput, complementation of poorly captured intervals with long amplicon data can significantly improve SARS-CoV-2 genomic coverage and variant detection.
format Online
Article
Text
id pubmed-8757904
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87579042022-01-14 A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome Arana, Carlos Liang, Chaoying Brock, Matthew Zhang, Bo Zhou, Jinchun Chen, Li Cantarel, Brandi SoRelle, Jeffrey Hooper, Lora V. Raj, Prithvi PLoS One Research Article High viral transmission in the COVID-19 pandemic has enabled SARS‐CoV‐2 to acquire new mutations that may impact genome sequencing methods. The ARTIC.v3 primer pool that amplifies short amplicons in a multiplex-PCR reaction is one of the most widely used methods for sequencing the SARS-CoV-2 genome. We observed that some genomic intervals are poorly captured with ARTIC primers. To improve the genomic coverage and variant detection across these intervals, we designed long amplicon primers and evaluated the performance of a short (ARTIC) plus long amplicon (MRL) sequencing approach. Sequencing assays were optimized on VR-1986D-ATCC RNA followed by sequencing of nasopharyngeal swab specimens from fifteen COVID-19 positive patients. ARTIC data covered 94.47% of the virus genome fraction in the positive control and patient samples. Variant analysis in the ARTIC data detected 217 mutations, including 209 single nucleotide variants (SNVs) and eight insertions & deletions. On the other hand, long-amplicon data detected 156 mutations, of which 80% were concordant with ARTIC data. Combined analysis of ARTIC + MRL data improved the genomic coverage to 97.03% and identified 214 high confidence mutations. The combined final set of 214 mutations included 203 SNVs, 8 deletions and 3 insertions. Analysis showed 26 SARS-CoV-2 lineage defining mutations including 4 known variants of concern K417N, E484K, N501Y, P618H in spike gene. Hybrid analysis identified 7 nonsynonymous and 5 synonymous mutations across the genome that were either ambiguous or not called in ARTIC data. For example, G172V mutation in the ORF3a protein and A2A mutation in Membrane protein were missed by the ARTIC assay. Thus, we show that while the short amplicon (ARTIC) assay provides good genomic coverage with high throughput, complementation of poorly captured intervals with long amplicon data can significantly improve SARS-CoV-2 genomic coverage and variant detection. Public Library of Science 2022-01-13 /pmc/articles/PMC8757904/ /pubmed/35025877 http://dx.doi.org/10.1371/journal.pone.0261014 Text en © 2022 Arana et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Arana, Carlos
Liang, Chaoying
Brock, Matthew
Zhang, Bo
Zhou, Jinchun
Chen, Li
Cantarel, Brandi
SoRelle, Jeffrey
Hooper, Lora V.
Raj, Prithvi
A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome
title A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome
title_full A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome
title_fullStr A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome
title_full_unstemmed A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome
title_short A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome
title_sort short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the sars-cov-2 genome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8757904/
https://www.ncbi.nlm.nih.gov/pubmed/35025877
http://dx.doi.org/10.1371/journal.pone.0261014
work_keys_str_mv AT aranacarlos ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT liangchaoying ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT brockmatthew ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT zhangbo ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT zhoujinchun ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT chenli ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT cantarelbrandi ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT sorellejeffrey ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT hooperlorav ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT rajprithvi ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT aranacarlos shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT liangchaoying shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT brockmatthew shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT zhangbo shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT zhoujinchun shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT chenli shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT cantarelbrandi shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT sorellejeffrey shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT hooperlorav shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome
AT rajprithvi shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome