Cargando…

ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell

BACKGROUND: The application of long-read sequencing using the Oxford Nanopore Technologies (ONT) MinION sequencer is getting more diverse in the medical field. Having a high sequencing error of ONT and limited throughput from a single MinION flowcell, however, limits its applicability for accurate v...

Descripción completa

Detalles Bibliográficos
Autores principales: Leung, Amy Wing-Sze, Leung, Henry Chi-Ming, Wong, Chak-Lim, Zheng, Zhen-Xian, Lui, Wui-Wang, Luk, Ho-Ming, Lo, Ivan Fai-Man, Luo, Ruibang, Lam, Tak-Wah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8895767/
https://www.ncbi.nlm.nih.gov/pubmed/35246132
http://dx.doi.org/10.1186/s12920-022-01190-3
_version_ 1784663006500618240
author Leung, Amy Wing-Sze
Leung, Henry Chi-Ming
Wong, Chak-Lim
Zheng, Zhen-Xian
Lui, Wui-Wang
Luk, Ho-Ming
Lo, Ivan Fai-Man
Luo, Ruibang
Lam, Tak-Wah
author_facet Leung, Amy Wing-Sze
Leung, Henry Chi-Ming
Wong, Chak-Lim
Zheng, Zhen-Xian
Lui, Wui-Wang
Luk, Ho-Ming
Lo, Ivan Fai-Man
Luo, Ruibang
Lam, Tak-Wah
author_sort Leung, Amy Wing-Sze
collection PubMed
description BACKGROUND: The application of long-read sequencing using the Oxford Nanopore Technologies (ONT) MinION sequencer is getting more diverse in the medical field. Having a high sequencing error of ONT and limited throughput from a single MinION flowcell, however, limits its applicability for accurate variant detection. Medical exome sequencing (MES) targets clinically significant exon regions, allowing rapid and comprehensive screening of pathogenic variants. By applying MES with MinION sequencing, the technology can achieve a more uniform capture of the target regions, shorter turnaround time, and lower sequencing cost per sample. METHOD: We introduced a cost-effective optimized workflow, ECNano, comprising a wet-lab protocol and bioinformatics analysis, for accurate variant detection at 4800 clinically important genes and regions using a single MinION flowcell. The ECNano wet-lab protocol was optimized to perform long-read target enrichment and ONT library preparation to stably generate high-quality MES data with adequate coverage. The subsequent variant-calling workflow, Clair-ensemble, adopted a fast RNN-based variant caller, Clair, and was optimized for target enrichment data. To evaluate its performance and practicality, ECNano was tested on both reference DNA samples and patient samples. RESULTS: ECNano achieved deep on-target depth of coverage (DoC) at average > 100× and > 98% uniformity using one MinION flowcell. For accurate ONT variant calling, the generated reads sufficiently covered 98.9% of pathogenic positions listed in ClinVar, with 98.96% having at least 30× DoC. ECNano obtained an average read length of 1000 bp. The long reads of ECNano also covered the adjacent splice sites well, with 98.5% of positions having ≥ 30× DoC. Clair-ensemble achieved > 99% recall and accuracy for SNV calling. The whole workflow from wet-lab protocol to variant detection was completed within three days. CONCLUSION: We presented ECNano, an out-of-the-box workflow comprising (1) a wet-lab protocol for ONT target enrichment sequencing and (2) a downstream variant detection workflow, Clair-ensemble. The workflow is cost-effective, with a short turnaround time for high accuracy variant calling in 4800 clinically significant genes and regions using a single MinION flowcell. The long-read exon captured data has potential for further development, promoting the application of long-read sequencing in personalized disease treatment and risk prediction. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12920-022-01190-3.
format Online
Article
Text
id pubmed-8895767
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-88957672022-03-10 ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell Leung, Amy Wing-Sze Leung, Henry Chi-Ming Wong, Chak-Lim Zheng, Zhen-Xian Lui, Wui-Wang Luk, Ho-Ming Lo, Ivan Fai-Man Luo, Ruibang Lam, Tak-Wah BMC Med Genomics Technical Advance BACKGROUND: The application of long-read sequencing using the Oxford Nanopore Technologies (ONT) MinION sequencer is getting more diverse in the medical field. Having a high sequencing error of ONT and limited throughput from a single MinION flowcell, however, limits its applicability for accurate variant detection. Medical exome sequencing (MES) targets clinically significant exon regions, allowing rapid and comprehensive screening of pathogenic variants. By applying MES with MinION sequencing, the technology can achieve a more uniform capture of the target regions, shorter turnaround time, and lower sequencing cost per sample. METHOD: We introduced a cost-effective optimized workflow, ECNano, comprising a wet-lab protocol and bioinformatics analysis, for accurate variant detection at 4800 clinically important genes and regions using a single MinION flowcell. The ECNano wet-lab protocol was optimized to perform long-read target enrichment and ONT library preparation to stably generate high-quality MES data with adequate coverage. The subsequent variant-calling workflow, Clair-ensemble, adopted a fast RNN-based variant caller, Clair, and was optimized for target enrichment data. To evaluate its performance and practicality, ECNano was tested on both reference DNA samples and patient samples. RESULTS: ECNano achieved deep on-target depth of coverage (DoC) at average > 100× and > 98% uniformity using one MinION flowcell. For accurate ONT variant calling, the generated reads sufficiently covered 98.9% of pathogenic positions listed in ClinVar, with 98.96% having at least 30× DoC. ECNano obtained an average read length of 1000 bp. The long reads of ECNano also covered the adjacent splice sites well, with 98.5% of positions having ≥ 30× DoC. Clair-ensemble achieved > 99% recall and accuracy for SNV calling. The whole workflow from wet-lab protocol to variant detection was completed within three days. CONCLUSION: We presented ECNano, an out-of-the-box workflow comprising (1) a wet-lab protocol for ONT target enrichment sequencing and (2) a downstream variant detection workflow, Clair-ensemble. The workflow is cost-effective, with a short turnaround time for high accuracy variant calling in 4800 clinically significant genes and regions using a single MinION flowcell. The long-read exon captured data has potential for further development, promoting the application of long-read sequencing in personalized disease treatment and risk prediction. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12920-022-01190-3. BioMed Central 2022-03-04 /pmc/articles/PMC8895767/ /pubmed/35246132 http://dx.doi.org/10.1186/s12920-022-01190-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Technical Advance
Leung, Amy Wing-Sze
Leung, Henry Chi-Ming
Wong, Chak-Lim
Zheng, Zhen-Xian
Lui, Wui-Wang
Luk, Ho-Ming
Lo, Ivan Fai-Man
Luo, Ruibang
Lam, Tak-Wah
ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell
title ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell
title_full ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell
title_fullStr ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell
title_full_unstemmed ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell
title_short ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell
title_sort ecnano: a cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single minion flowcell
topic Technical Advance
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8895767/
https://www.ncbi.nlm.nih.gov/pubmed/35246132
http://dx.doi.org/10.1186/s12920-022-01190-3
work_keys_str_mv AT leungamywingsze ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell
AT leunghenrychiming ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell
AT wongchaklim ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell
AT zhengzhenxian ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell
AT luiwuiwang ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell
AT lukhoming ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell
AT loivanfaiman ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell
AT luoruibang ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell
AT lamtakwah ecnanoacosteffectiveworkflowfortargetenrichmentsequencingandaccuratevariantcallingon4800clinicallysignificantgenesusingasingleminionflowcell