Cargando…
DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing
We consider the correction of errors from nucleotide sequences produced by next-generation targeted amplicon sequencing. The next-generation sequencing (NGS) platforms can provide a great deal of sequencing data thanks to their high throughput, but the associated error rates often tend to be high. D...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5531809/ https://www.ncbi.nlm.nih.gov/pubmed/28749987 http://dx.doi.org/10.1371/journal.pone.0181463 |
_version_ | 1783253400752750592 |
---|---|
author | Lee, Byunghan Moon, Taesup Yoon, Sungroh Weissman, Tsachy |
author_facet | Lee, Byunghan Moon, Taesup Yoon, Sungroh Weissman, Tsachy |
author_sort | Lee, Byunghan |
collection | PubMed |
description | We consider the correction of errors from nucleotide sequences produced by next-generation targeted amplicon sequencing. The next-generation sequencing (NGS) platforms can provide a great deal of sequencing data thanks to their high throughput, but the associated error rates often tend to be high. Denoising in high-throughput sequencing has thus become a crucial process for boosting the reliability of downstream analyses. Our methodology, named DUDE-Seq, is derived from a general setting of reconstructing finite-valued source data corrupted by a discrete memoryless channel and effectively corrects substitution and homopolymer indel errors, the two major types of sequencing errors in most high-throughput targeted amplicon sequencing platforms. Our experimental studies with real and simulated datasets suggest that the proposed DUDE-Seq not only outperforms existing alternatives in terms of error-correction capability and time efficiency, but also boosts the reliability of downstream analyses. Further, the flexibility of DUDE-Seq enables its robust application to different sequencing platforms and analysis pipelines by simple updates of the noise model. DUDE-Seq is available at http://data.snu.ac.kr/pub/dude-seq. |
format | Online Article Text |
id | pubmed-5531809 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-55318092017-08-07 DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing Lee, Byunghan Moon, Taesup Yoon, Sungroh Weissman, Tsachy PLoS One Research Article We consider the correction of errors from nucleotide sequences produced by next-generation targeted amplicon sequencing. The next-generation sequencing (NGS) platforms can provide a great deal of sequencing data thanks to their high throughput, but the associated error rates often tend to be high. Denoising in high-throughput sequencing has thus become a crucial process for boosting the reliability of downstream analyses. Our methodology, named DUDE-Seq, is derived from a general setting of reconstructing finite-valued source data corrupted by a discrete memoryless channel and effectively corrects substitution and homopolymer indel errors, the two major types of sequencing errors in most high-throughput targeted amplicon sequencing platforms. Our experimental studies with real and simulated datasets suggest that the proposed DUDE-Seq not only outperforms existing alternatives in terms of error-correction capability and time efficiency, but also boosts the reliability of downstream analyses. Further, the flexibility of DUDE-Seq enables its robust application to different sequencing platforms and analysis pipelines by simple updates of the noise model. DUDE-Seq is available at http://data.snu.ac.kr/pub/dude-seq. Public Library of Science 2017-07-27 /pmc/articles/PMC5531809/ /pubmed/28749987 http://dx.doi.org/10.1371/journal.pone.0181463 Text en © 2017 Lee et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Lee, Byunghan Moon, Taesup Yoon, Sungroh Weissman, Tsachy DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing |
title | DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing |
title_full | DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing |
title_fullStr | DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing |
title_full_unstemmed | DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing |
title_short | DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing |
title_sort | dude-seq: fast, flexible, and robust denoising for targeted amplicon sequencing |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5531809/ https://www.ncbi.nlm.nih.gov/pubmed/28749987 http://dx.doi.org/10.1371/journal.pone.0181463 |
work_keys_str_mv | AT leebyunghan dudeseqfastflexibleandrobustdenoisingfortargetedampliconsequencing AT moontaesup dudeseqfastflexibleandrobustdenoisingfortargetedampliconsequencing AT yoonsungroh dudeseqfastflexibleandrobustdenoisingfortargetedampliconsequencing AT weissmantsachy dudeseqfastflexibleandrobustdenoisingfortargetedampliconsequencing |