Cargando…

DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing

We consider the correction of errors from nucleotide sequences produced by next-generation targeted amplicon sequencing. The next-generation sequencing (NGS) platforms can provide a great deal of sequencing data thanks to their high throughput, but the associated error rates often tend to be high. D...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Byunghan, Moon, Taesup, Yoon, Sungroh, Weissman, Tsachy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5531809/
https://www.ncbi.nlm.nih.gov/pubmed/28749987
http://dx.doi.org/10.1371/journal.pone.0181463
_version_ 1783253400752750592
author Lee, Byunghan
Moon, Taesup
Yoon, Sungroh
Weissman, Tsachy
author_facet Lee, Byunghan
Moon, Taesup
Yoon, Sungroh
Weissman, Tsachy
author_sort Lee, Byunghan
collection PubMed
description We consider the correction of errors from nucleotide sequences produced by next-generation targeted amplicon sequencing. The next-generation sequencing (NGS) platforms can provide a great deal of sequencing data thanks to their high throughput, but the associated error rates often tend to be high. Denoising in high-throughput sequencing has thus become a crucial process for boosting the reliability of downstream analyses. Our methodology, named DUDE-Seq, is derived from a general setting of reconstructing finite-valued source data corrupted by a discrete memoryless channel and effectively corrects substitution and homopolymer indel errors, the two major types of sequencing errors in most high-throughput targeted amplicon sequencing platforms. Our experimental studies with real and simulated datasets suggest that the proposed DUDE-Seq not only outperforms existing alternatives in terms of error-correction capability and time efficiency, but also boosts the reliability of downstream analyses. Further, the flexibility of DUDE-Seq enables its robust application to different sequencing platforms and analysis pipelines by simple updates of the noise model. DUDE-Seq is available at http://data.snu.ac.kr/pub/dude-seq.
format Online
Article
Text
id pubmed-5531809
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55318092017-08-07 DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing Lee, Byunghan Moon, Taesup Yoon, Sungroh Weissman, Tsachy PLoS One Research Article We consider the correction of errors from nucleotide sequences produced by next-generation targeted amplicon sequencing. The next-generation sequencing (NGS) platforms can provide a great deal of sequencing data thanks to their high throughput, but the associated error rates often tend to be high. Denoising in high-throughput sequencing has thus become a crucial process for boosting the reliability of downstream analyses. Our methodology, named DUDE-Seq, is derived from a general setting of reconstructing finite-valued source data corrupted by a discrete memoryless channel and effectively corrects substitution and homopolymer indel errors, the two major types of sequencing errors in most high-throughput targeted amplicon sequencing platforms. Our experimental studies with real and simulated datasets suggest that the proposed DUDE-Seq not only outperforms existing alternatives in terms of error-correction capability and time efficiency, but also boosts the reliability of downstream analyses. Further, the flexibility of DUDE-Seq enables its robust application to different sequencing platforms and analysis pipelines by simple updates of the noise model. DUDE-Seq is available at http://data.snu.ac.kr/pub/dude-seq. Public Library of Science 2017-07-27 /pmc/articles/PMC5531809/ /pubmed/28749987 http://dx.doi.org/10.1371/journal.pone.0181463 Text en © 2017 Lee et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lee, Byunghan
Moon, Taesup
Yoon, Sungroh
Weissman, Tsachy
DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing
title DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing
title_full DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing
title_fullStr DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing
title_full_unstemmed DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing
title_short DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing
title_sort dude-seq: fast, flexible, and robust denoising for targeted amplicon sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5531809/
https://www.ncbi.nlm.nih.gov/pubmed/28749987
http://dx.doi.org/10.1371/journal.pone.0181463
work_keys_str_mv AT leebyunghan dudeseqfastflexibleandrobustdenoisingfortargetedampliconsequencing
AT moontaesup dudeseqfastflexibleandrobustdenoisingfortargetedampliconsequencing
AT yoonsungroh dudeseqfastflexibleandrobustdenoisingfortargetedampliconsequencing
AT weissmantsachy dudeseqfastflexibleandrobustdenoisingfortargetedampliconsequencing