Cargando…

Correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction

The accuracy of Oxford Nanopore Technology (ONT) sequencing has significantly improved thanks to new flowcells, sequencing kits, and basecalling algorithms. However, novel modification types untrained in the basecalling models can seriously reduce the quality. Here we reports a set of ONT-sequenced...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiou, Chien-Shun, Chen, Bo-Han, Wang, You-Wun, Kuo, Nang-Ting, Chang, Chih-Hsiang, Huang, Yao-Ting
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10687267/
https://www.ncbi.nlm.nih.gov/pubmed/38030695
http://dx.doi.org/10.1038/s42003-023-05605-4
_version_ 1785151947343396864
author Chiou, Chien-Shun
Chen, Bo-Han
Wang, You-Wun
Kuo, Nang-Ting
Chang, Chih-Hsiang
Huang, Yao-Ting
author_facet Chiou, Chien-Shun
Chen, Bo-Han
Wang, You-Wun
Kuo, Nang-Ting
Chang, Chih-Hsiang
Huang, Yao-Ting
author_sort Chiou, Chien-Shun
collection PubMed
description The accuracy of Oxford Nanopore Technology (ONT) sequencing has significantly improved thanks to new flowcells, sequencing kits, and basecalling algorithms. However, novel modification types untrained in the basecalling models can seriously reduce the quality. Here we reports a set of ONT-sequenced genomes with unexpected low quality due to novel modification types. Demodification by whole-genome amplification significantly improved the quality but lost the epigenome. We also developed a reference-based method, Modpolish, for correcting modification-mediated errors while retaining the epigenome when a sufficient number of closely-related genomes is publicly available (default: top 20 genomes with at least 95% identity). Modpolish not only significantly improved the quality of in-house sequenced genomes but also public datasets sequenced by R9.4 and R10.4 (simplex). Our results suggested that novel modifications are prone to ONT systematic errors. Nevertheless, these errors are correctable by nucleotide demodification or Modpolish without prior knowledge of modifications.
format Online
Article
Text
id pubmed-10687267
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106872672023-11-30 Correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction Chiou, Chien-Shun Chen, Bo-Han Wang, You-Wun Kuo, Nang-Ting Chang, Chih-Hsiang Huang, Yao-Ting Commun Biol Article The accuracy of Oxford Nanopore Technology (ONT) sequencing has significantly improved thanks to new flowcells, sequencing kits, and basecalling algorithms. However, novel modification types untrained in the basecalling models can seriously reduce the quality. Here we reports a set of ONT-sequenced genomes with unexpected low quality due to novel modification types. Demodification by whole-genome amplification significantly improved the quality but lost the epigenome. We also developed a reference-based method, Modpolish, for correcting modification-mediated errors while retaining the epigenome when a sufficient number of closely-related genomes is publicly available (default: top 20 genomes with at least 95% identity). Modpolish not only significantly improved the quality of in-house sequenced genomes but also public datasets sequenced by R9.4 and R10.4 (simplex). Our results suggested that novel modifications are prone to ONT systematic errors. Nevertheless, these errors are correctable by nucleotide demodification or Modpolish without prior knowledge of modifications. Nature Publishing Group UK 2023-11-29 /pmc/articles/PMC10687267/ /pubmed/38030695 http://dx.doi.org/10.1038/s42003-023-05605-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chiou, Chien-Shun
Chen, Bo-Han
Wang, You-Wun
Kuo, Nang-Ting
Chang, Chih-Hsiang
Huang, Yao-Ting
Correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction
title Correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction
title_full Correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction
title_fullStr Correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction
title_full_unstemmed Correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction
title_short Correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction
title_sort correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10687267/
https://www.ncbi.nlm.nih.gov/pubmed/38030695
http://dx.doi.org/10.1038/s42003-023-05605-4
work_keys_str_mv AT chiouchienshun correctingmodificationmediatederrorsinnanoporesequencingbynucleotidedemodificationandreferencebasedcorrection
AT chenbohan correctingmodificationmediatederrorsinnanoporesequencingbynucleotidedemodificationandreferencebasedcorrection
AT wangyouwun correctingmodificationmediatederrorsinnanoporesequencingbynucleotidedemodificationandreferencebasedcorrection
AT kuonangting correctingmodificationmediatederrorsinnanoporesequencingbynucleotidedemodificationandreferencebasedcorrection
AT changchihhsiang correctingmodificationmediatederrorsinnanoporesequencingbynucleotidedemodificationandreferencebasedcorrection
AT huangyaoting correctingmodificationmediatederrorsinnanoporesequencingbynucleotidedemodificationandreferencebasedcorrection