Cargando…
CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing
Merging the forward and reverse reads from paired-end sequencing is a critical task that can significantly improve the performance of downstream tasks, such as genome assembly and mapping, by providing them with virtually elongated reads. However, due to the inherent limitations of most paired-end s...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168710/ https://www.ncbi.nlm.nih.gov/pubmed/25252785 http://dx.doi.org/10.1186/1471-2105-15-S9-S10 |
_version_ | 1782335603911688192 |
---|---|
author | Kwon, Sunyoung Lee, Byunghan Yoon, Sungroh |
author_facet | Kwon, Sunyoung Lee, Byunghan Yoon, Sungroh |
author_sort | Kwon, Sunyoung |
collection | PubMed |
description | Merging the forward and reverse reads from paired-end sequencing is a critical task that can significantly improve the performance of downstream tasks, such as genome assembly and mapping, by providing them with virtually elongated reads. However, due to the inherent limitations of most paired-end sequencers, the chance of observing erroneous bases grows rapidly as the end of a read is approached, which becomes a critical hurdle for accurately merging paired-end reads. Although there exist several sophisticated approaches to this problem, their performance in terms of quality of merging often remains unsatisfactory. To address this issue, here we present a context-aware scheme for paired-end reads (CASPER): a computational method to rapidly and robustly merge overlapping paired-end reads. Being particularly well suited to amplicon sequencing applications, CASPER is thoroughly tested with both simulated and real high-throughput amplicon sequencing data. According to our experimental results, CASPER significantly outperforms existing state-of-the art paired-end merging tools in terms of accuracy and robustness. CASPER also exploits the parallelism in the task of paired-end merging and effectively speeds up by multithreading. CASPER is freely available for academic use at http://best.snu.ac.kr/casper. |
format | Online Article Text |
id | pubmed-4168710 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41687102014-10-02 CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing Kwon, Sunyoung Lee, Byunghan Yoon, Sungroh BMC Bioinformatics Proceedings Merging the forward and reverse reads from paired-end sequencing is a critical task that can significantly improve the performance of downstream tasks, such as genome assembly and mapping, by providing them with virtually elongated reads. However, due to the inherent limitations of most paired-end sequencers, the chance of observing erroneous bases grows rapidly as the end of a read is approached, which becomes a critical hurdle for accurately merging paired-end reads. Although there exist several sophisticated approaches to this problem, their performance in terms of quality of merging often remains unsatisfactory. To address this issue, here we present a context-aware scheme for paired-end reads (CASPER): a computational method to rapidly and robustly merge overlapping paired-end reads. Being particularly well suited to amplicon sequencing applications, CASPER is thoroughly tested with both simulated and real high-throughput amplicon sequencing data. According to our experimental results, CASPER significantly outperforms existing state-of-the art paired-end merging tools in terms of accuracy and robustness. CASPER also exploits the parallelism in the task of paired-end merging and effectively speeds up by multithreading. CASPER is freely available for academic use at http://best.snu.ac.kr/casper. BioMed Central 2014-09-10 /pmc/articles/PMC4168710/ /pubmed/25252785 http://dx.doi.org/10.1186/1471-2105-15-S9-S10 Text en Copyright © 2014 Kwon et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Kwon, Sunyoung Lee, Byunghan Yoon, Sungroh CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing |
title | CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing |
title_full | CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing |
title_fullStr | CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing |
title_full_unstemmed | CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing |
title_short | CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing |
title_sort | casper: context-aware scheme for paired-end reads from high-throughput amplicon sequencing |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168710/ https://www.ncbi.nlm.nih.gov/pubmed/25252785 http://dx.doi.org/10.1186/1471-2105-15-S9-S10 |
work_keys_str_mv | AT kwonsunyoung caspercontextawareschemeforpairedendreadsfromhighthroughputampliconsequencing AT leebyunghan caspercontextawareschemeforpairedendreadsfromhighthroughputampliconsequencing AT yoonsungroh caspercontextawareschemeforpairedendreadsfromhighthroughputampliconsequencing |