Cargando…

CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing

Merging the forward and reverse reads from paired-end sequencing is a critical task that can significantly improve the performance of downstream tasks, such as genome assembly and mapping, by providing them with virtually elongated reads. However, due to the inherent limitations of most paired-end s...

Descripción completa

Detalles Bibliográficos
Autores principales: Kwon, Sunyoung, Lee, Byunghan, Yoon, Sungroh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168710/
https://www.ncbi.nlm.nih.gov/pubmed/25252785
http://dx.doi.org/10.1186/1471-2105-15-S9-S10
_version_ 1782335603911688192
author Kwon, Sunyoung
Lee, Byunghan
Yoon, Sungroh
author_facet Kwon, Sunyoung
Lee, Byunghan
Yoon, Sungroh
author_sort Kwon, Sunyoung
collection PubMed
description Merging the forward and reverse reads from paired-end sequencing is a critical task that can significantly improve the performance of downstream tasks, such as genome assembly and mapping, by providing them with virtually elongated reads. However, due to the inherent limitations of most paired-end sequencers, the chance of observing erroneous bases grows rapidly as the end of a read is approached, which becomes a critical hurdle for accurately merging paired-end reads. Although there exist several sophisticated approaches to this problem, their performance in terms of quality of merging often remains unsatisfactory. To address this issue, here we present a context-aware scheme for paired-end reads (CASPER): a computational method to rapidly and robustly merge overlapping paired-end reads. Being particularly well suited to amplicon sequencing applications, CASPER is thoroughly tested with both simulated and real high-throughput amplicon sequencing data. According to our experimental results, CASPER significantly outperforms existing state-of-the art paired-end merging tools in terms of accuracy and robustness. CASPER also exploits the parallelism in the task of paired-end merging and effectively speeds up by multithreading. CASPER is freely available for academic use at http://best.snu.ac.kr/casper.
format Online
Article
Text
id pubmed-4168710
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41687102014-10-02 CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing Kwon, Sunyoung Lee, Byunghan Yoon, Sungroh BMC Bioinformatics Proceedings Merging the forward and reverse reads from paired-end sequencing is a critical task that can significantly improve the performance of downstream tasks, such as genome assembly and mapping, by providing them with virtually elongated reads. However, due to the inherent limitations of most paired-end sequencers, the chance of observing erroneous bases grows rapidly as the end of a read is approached, which becomes a critical hurdle for accurately merging paired-end reads. Although there exist several sophisticated approaches to this problem, their performance in terms of quality of merging often remains unsatisfactory. To address this issue, here we present a context-aware scheme for paired-end reads (CASPER): a computational method to rapidly and robustly merge overlapping paired-end reads. Being particularly well suited to amplicon sequencing applications, CASPER is thoroughly tested with both simulated and real high-throughput amplicon sequencing data. According to our experimental results, CASPER significantly outperforms existing state-of-the art paired-end merging tools in terms of accuracy and robustness. CASPER also exploits the parallelism in the task of paired-end merging and effectively speeds up by multithreading. CASPER is freely available for academic use at http://best.snu.ac.kr/casper. BioMed Central 2014-09-10 /pmc/articles/PMC4168710/ /pubmed/25252785 http://dx.doi.org/10.1186/1471-2105-15-S9-S10 Text en Copyright © 2014 Kwon et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Kwon, Sunyoung
Lee, Byunghan
Yoon, Sungroh
CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing
title CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing
title_full CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing
title_fullStr CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing
title_full_unstemmed CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing
title_short CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing
title_sort casper: context-aware scheme for paired-end reads from high-throughput amplicon sequencing
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168710/
https://www.ncbi.nlm.nih.gov/pubmed/25252785
http://dx.doi.org/10.1186/1471-2105-15-S9-S10
work_keys_str_mv AT kwonsunyoung caspercontextawareschemeforpairedendreadsfromhighthroughputampliconsequencing
AT leebyunghan caspercontextawareschemeforpairedendreadsfromhighthroughputampliconsequencing
AT yoonsungroh caspercontextawareschemeforpairedendreadsfromhighthroughputampliconsequencing