Cargando…
ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell
BACKGROUND: The application of long-read sequencing using the Oxford Nanopore Technologies (ONT) MinION sequencer is getting more diverse in the medical field. Having a high sequencing error of ONT and limited throughput from a single MinION flowcell, however, limits its applicability for accurate v...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8895767/ https://www.ncbi.nlm.nih.gov/pubmed/35246132 http://dx.doi.org/10.1186/s12920-022-01190-3 |
_version_ | 1784663006500618240 |
---|---|
author | Leung, Amy Wing-Sze Leung, Henry Chi-Ming Wong, Chak-Lim Zheng, Zhen-Xian Lui, Wui-Wang Luk, Ho-Ming Lo, Ivan Fai-Man Luo, Ruibang Lam, Tak-Wah |
author_facet | Leung, Amy Wing-Sze Leung, Henry Chi-Ming Wong, Chak-Lim Zheng, Zhen-Xian Lui, Wui-Wang Luk, Ho-Ming Lo, Ivan Fai-Man Luo, Ruibang Lam, Tak-Wah |
author_sort | Leung, Amy Wing-Sze |
collection | PubMed |
description | BACKGROUND: The application of long-read sequencing using the Oxford Nanopore Technologies (ONT) MinION sequencer is getting more diverse in the medical field. Having a high sequencing error of ONT and limited throughput from a single MinION flowcell, however, limits its applicability for accurate variant detection. Medical exome sequencing (MES) targets clinically significant exon regions, allowing rapid and comprehensive screening of pathogenic variants. By applying MES with MinION sequencing, the technology can achieve a more uniform capture of the target regions, shorter turnaround time, and lower sequencing cost per sample. METHOD: We introduced a cost-effective optimized workflow, ECNano, comprising a wet-lab protocol and bioinformatics analysis, for accurate variant detection at 4800 clinically important genes and regions using a single MinION flowcell. The ECNano wet-lab protocol was optimized to perform long-read target enrichment and ONT library preparation to stably generate high-quality MES data with adequate coverage. The subsequent variant-calling workflow, Clair-ensemble, adopted a fast RNN-based variant caller, Clair, and was optimized for target enrichment data. To evaluate its performance and practicality, ECNano was tested on both reference DNA samples and patient samples. RESULTS: ECNano achieved deep on-target depth of coverage (DoC) at average > 100× and > 98% uniformity using one MinION flowcell. For accurate ONT variant calling, the generated reads sufficiently covered 98.9% of pathogenic positions listed in ClinVar, with 98.96% having at least 30× DoC. ECNano obtained an average read length of 1000 bp. The long reads of ECNano also covered the adjacent splice sites well, with 98.5% of positions having ≥ 30× DoC. Clair-ensemble achieved > 99% recall and accuracy for SNV calling. The whole workflow from wet-lab protocol to variant detection was completed within three days. CONCLUSION: We presented ECNano, an out-of-the-box workflow comprising (1) a wet-lab protocol for ONT target enrichment sequencing and (2) a downstream variant detection workflow, Clair-ensemble. The workflow is cost-effective, with a short turnaround time for high accuracy variant calling in 4800 clinically significant genes and regions using a single MinION flowcell. The long-read exon captured data has potential for further development, promoting the application of long-read sequencing in personalized disease treatment and risk prediction. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12920-022-01190-3. |
format | Online Article Text |
id | pubmed-8895767 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-88957672022-03-10 ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell Leung, Amy Wing-Sze Leung, Henry Chi-Ming Wong, Chak-Lim Zheng, Zhen-Xian Lui, Wui-Wang Luk, Ho-Ming Lo, Ivan Fai-Man Luo, Ruibang Lam, Tak-Wah BMC Med Genomics Technical Advance BACKGROUND: The application of long-read sequencing using the Oxford Nanopore Technologies (ONT) MinION sequencer is getting more diverse in the medical field. Having a high sequencing error of ONT and limited throughput from a single MinION flowcell, however, limits its applicability for accurate variant detection. Medical exome sequencing (MES) targets clinically significant exon regions, allowing rapid and comprehensive screening of pathogenic variants. By applying MES with MinION sequencing, the technology can achieve a more uniform capture of the target regions, shorter turnaround time, and lower sequencing cost per sample. METHOD: We introduced a cost-effective optimized workflow, ECNano, comprising a wet-lab protocol and bioinformatics analysis, for accurate variant detection at 4800 clinically important genes and regions using a single MinION flowcell. The ECNano wet-lab protocol was optimized to perform long-read target enrichment and ONT library preparation to stably generate high-quality MES data with adequate coverage. The subsequent variant-calling workflow, Clair-ensemble, adopted a fast RNN-based variant caller, Clair, and was optimized for target enrichment data. To evaluate its performance and practicality, ECNano was tested on both reference DNA samples and patient samples. RESULTS: ECNano achieved deep on-target depth of coverage (DoC) at average > 100× and > 98% uniformity using one MinION flowcell. For accurate ONT variant calling, the generated reads sufficiently covered 98.9% of pathogenic positions listed in ClinVar, with 98.96% having at least 30× DoC. ECNano obtained an average read length of 1000 bp. The long reads of ECNano also covered the adjacent splice sites well, with 98.5% of positions having ≥ 30× DoC. Clair-ensemble achieved > 99% recall and accuracy for SNV calling. The whole workflow from wet-lab protocol to variant detection was completed within three days. CONCLUSION: We presented ECNano, an out-of-the-box workflow comprising (1) a wet-lab protocol for ONT target enrichment sequencing and (2) a downstream variant detection workflow, Clair-ensemble. The workflow is cost-effective, with a short turnaround time for high accuracy variant calling in 4800 clinically significant genes and regions using a single MinION flowcell. The long-read exon captured data has potential for further development, promoting the application of long-read sequencing in personalized disease treatment and risk prediction. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12920-022-01190-3. BioMed Central 2022-03-04 /pmc/articles/PMC8895767/ /pubmed/35246132 http://dx.doi.org/10.1186/s12920-022-01190-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Technical Advance Leung, Amy Wing-Sze Leung, Henry Chi-Ming Wong, Chak-Lim Zheng, Zhen-Xian Lui, Wui-Wang Luk, Ho-Ming Lo, Ivan Fai-Man Luo, Ruibang Lam, Tak-Wah ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell |
title | ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell |
title_full | ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell |
title_fullStr | ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell |
title_full_unstemmed | ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell |
title_short | ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell |
title_sort | ecnano: a cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single minion flowcell |
topic | Technical Advance |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8895767/ https://www.ncbi.nlm.nih.gov/pubmed/35246132 http://dx.doi.org/10.1186/s12920-022-01190-3 |
work_keys_str_mv | AT leungamywingsze ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell AT leunghenrychiming ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell AT wongchaklim ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell AT zhengzhenxian ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell AT luiwuiwang ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell AT lukhoming ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell AT loivanfaiman ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell AT luoruibang ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell AT lamtakwah ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell |