Cargando…
A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome
High viral transmission in the COVID-19 pandemic has enabled SARS‐CoV‐2 to acquire new mutations that may impact genome sequencing methods. The ARTIC.v3 primer pool that amplifies short amplicons in a multiplex-PCR reaction is one of the most widely used methods for sequencing the SARS-CoV-2 genome....
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8757904/ https://www.ncbi.nlm.nih.gov/pubmed/35025877 http://dx.doi.org/10.1371/journal.pone.0261014 |
_version_ | 1784632783546613760 |
---|---|
author | Arana, Carlos Liang, Chaoying Brock, Matthew Zhang, Bo Zhou, Jinchun Chen, Li Cantarel, Brandi SoRelle, Jeffrey Hooper, Lora V. Raj, Prithvi |
author_facet | Arana, Carlos Liang, Chaoying Brock, Matthew Zhang, Bo Zhou, Jinchun Chen, Li Cantarel, Brandi SoRelle, Jeffrey Hooper, Lora V. Raj, Prithvi |
author_sort | Arana, Carlos |
collection | PubMed |
description | High viral transmission in the COVID-19 pandemic has enabled SARS‐CoV‐2 to acquire new mutations that may impact genome sequencing methods. The ARTIC.v3 primer pool that amplifies short amplicons in a multiplex-PCR reaction is one of the most widely used methods for sequencing the SARS-CoV-2 genome. We observed that some genomic intervals are poorly captured with ARTIC primers. To improve the genomic coverage and variant detection across these intervals, we designed long amplicon primers and evaluated the performance of a short (ARTIC) plus long amplicon (MRL) sequencing approach. Sequencing assays were optimized on VR-1986D-ATCC RNA followed by sequencing of nasopharyngeal swab specimens from fifteen COVID-19 positive patients. ARTIC data covered 94.47% of the virus genome fraction in the positive control and patient samples. Variant analysis in the ARTIC data detected 217 mutations, including 209 single nucleotide variants (SNVs) and eight insertions & deletions. On the other hand, long-amplicon data detected 156 mutations, of which 80% were concordant with ARTIC data. Combined analysis of ARTIC + MRL data improved the genomic coverage to 97.03% and identified 214 high confidence mutations. The combined final set of 214 mutations included 203 SNVs, 8 deletions and 3 insertions. Analysis showed 26 SARS-CoV-2 lineage defining mutations including 4 known variants of concern K417N, E484K, N501Y, P618H in spike gene. Hybrid analysis identified 7 nonsynonymous and 5 synonymous mutations across the genome that were either ambiguous or not called in ARTIC data. For example, G172V mutation in the ORF3a protein and A2A mutation in Membrane protein were missed by the ARTIC assay. Thus, we show that while the short amplicon (ARTIC) assay provides good genomic coverage with high throughput, complementation of poorly captured intervals with long amplicon data can significantly improve SARS-CoV-2 genomic coverage and variant detection. |
format | Online Article Text |
id | pubmed-8757904 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-87579042022-01-14 A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome Arana, Carlos Liang, Chaoying Brock, Matthew Zhang, Bo Zhou, Jinchun Chen, Li Cantarel, Brandi SoRelle, Jeffrey Hooper, Lora V. Raj, Prithvi PLoS One Research Article High viral transmission in the COVID-19 pandemic has enabled SARS‐CoV‐2 to acquire new mutations that may impact genome sequencing methods. The ARTIC.v3 primer pool that amplifies short amplicons in a multiplex-PCR reaction is one of the most widely used methods for sequencing the SARS-CoV-2 genome. We observed that some genomic intervals are poorly captured with ARTIC primers. To improve the genomic coverage and variant detection across these intervals, we designed long amplicon primers and evaluated the performance of a short (ARTIC) plus long amplicon (MRL) sequencing approach. Sequencing assays were optimized on VR-1986D-ATCC RNA followed by sequencing of nasopharyngeal swab specimens from fifteen COVID-19 positive patients. ARTIC data covered 94.47% of the virus genome fraction in the positive control and patient samples. Variant analysis in the ARTIC data detected 217 mutations, including 209 single nucleotide variants (SNVs) and eight insertions & deletions. On the other hand, long-amplicon data detected 156 mutations, of which 80% were concordant with ARTIC data. Combined analysis of ARTIC + MRL data improved the genomic coverage to 97.03% and identified 214 high confidence mutations. The combined final set of 214 mutations included 203 SNVs, 8 deletions and 3 insertions. Analysis showed 26 SARS-CoV-2 lineage defining mutations including 4 known variants of concern K417N, E484K, N501Y, P618H in spike gene. Hybrid analysis identified 7 nonsynonymous and 5 synonymous mutations across the genome that were either ambiguous or not called in ARTIC data. For example, G172V mutation in the ORF3a protein and A2A mutation in Membrane protein were missed by the ARTIC assay. Thus, we show that while the short amplicon (ARTIC) assay provides good genomic coverage with high throughput, complementation of poorly captured intervals with long amplicon data can significantly improve SARS-CoV-2 genomic coverage and variant detection. Public Library of Science 2022-01-13 /pmc/articles/PMC8757904/ /pubmed/35025877 http://dx.doi.org/10.1371/journal.pone.0261014 Text en © 2022 Arana et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Arana, Carlos Liang, Chaoying Brock, Matthew Zhang, Bo Zhou, Jinchun Chen, Li Cantarel, Brandi SoRelle, Jeffrey Hooper, Lora V. Raj, Prithvi A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome |
title | A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome |
title_full | A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome |
title_fullStr | A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome |
title_full_unstemmed | A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome |
title_short | A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome |
title_sort | short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the sars-cov-2 genome |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8757904/ https://www.ncbi.nlm.nih.gov/pubmed/35025877 http://dx.doi.org/10.1371/journal.pone.0261014 |
work_keys_str_mv | AT aranacarlos ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT liangchaoying ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT brockmatthew ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT zhangbo ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT zhoujinchun ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT chenli ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT cantarelbrandi ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT sorellejeffrey ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT hooperlorav ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT rajprithvi ashortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT aranacarlos shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT liangchaoying shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT brockmatthew shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT zhangbo shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT zhoujinchun shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT chenli shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT cantarelbrandi shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT sorellejeffrey shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT hooperlorav shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome AT rajprithvi shortpluslongampliconbasedsequencingapproachimprovesgenomiccoverageandvariantdetectioninthesarscov2genome |