Cargando…

PEAR: a fast and accurate Illumina Paired-End reAd mergeR

Motivation: The Illumina paired-end sequencing technology can generate reads from both ends of target DNA fragments, which can subsequently be merged to increase the overall read length. There already exist tools for merging these paired-end reads when the target fragments are equally long. However,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Jiajie, Kobert, Kassian, Flouri, Tomáš, Stamatakis, Alexandros
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2014
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3933873/ https://www.ncbi.nlm.nih.gov/pubmed/24142950 http://dx.doi.org/10.1093/bioinformatics/btt593

_version_	1782305001869148160
author	Zhang, Jiajie Kobert, Kassian Flouri, Tomáš Stamatakis, Alexandros
author_facet	Zhang, Jiajie Kobert, Kassian Flouri, Tomáš Stamatakis, Alexandros
author_sort	Zhang, Jiajie
collection	PubMed
description	Motivation: The Illumina paired-end sequencing technology can generate reads from both ends of target DNA fragments, which can subsequently be merged to increase the overall read length. There already exist tools for merging these paired-end reads when the target fragments are equally long. However, when fragment lengths vary and, in particular, when either the fragment size is shorter than a single-end read, or longer than twice the size of a single-end read, most state-of-the-art mergers fail to generate reliable results. Therefore, a robust tool is needed to merge paired-end reads that exhibit varying overlap lengths because of varying target fragment lengths. Results: We present the PEAR software for merging raw Illumina paired-end reads from target fragments of varying length. The program evaluates all possible paired-end read overlaps and does not require the target fragment size as input. It also implements a statistical test for minimizing false-positive results. Tests on simulated and empirical data show that PEAR consistently generates highly accurate merged paired-end reads. A highly optimized implementation allows for merging millions of paired-end reads within a few minutes on a standard desktop computer. On multi-core architectures, the parallel version of PEAR shows linear speedups compared with the sequential version of PEAR. Availability and implementation: PEAR is implemented in C and uses POSIX threads. It is freely available at http://www.exelixis-lab.org/web/software/pear. Contact: Tomas.Flouri@h-its.org
format	Online Article Text
id	pubmed-3933873
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-39338732014-03-12 PEAR: a fast and accurate Illumina Paired-End reAd mergeR Zhang, Jiajie Kobert, Kassian Flouri, Tomáš Stamatakis, Alexandros Bioinformatics Original Papers Motivation: The Illumina paired-end sequencing technology can generate reads from both ends of target DNA fragments, which can subsequently be merged to increase the overall read length. There already exist tools for merging these paired-end reads when the target fragments are equally long. However, when fragment lengths vary and, in particular, when either the fragment size is shorter than a single-end read, or longer than twice the size of a single-end read, most state-of-the-art mergers fail to generate reliable results. Therefore, a robust tool is needed to merge paired-end reads that exhibit varying overlap lengths because of varying target fragment lengths. Results: We present the PEAR software for merging raw Illumina paired-end reads from target fragments of varying length. The program evaluates all possible paired-end read overlaps and does not require the target fragment size as input. It also implements a statistical test for minimizing false-positive results. Tests on simulated and empirical data show that PEAR consistently generates highly accurate merged paired-end reads. A highly optimized implementation allows for merging millions of paired-end reads within a few minutes on a standard desktop computer. On multi-core architectures, the parallel version of PEAR shows linear speedups compared with the sequential version of PEAR. Availability and implementation: PEAR is implemented in C and uses POSIX threads. It is freely available at http://www.exelixis-lab.org/web/software/pear. Contact: Tomas.Flouri@h-its.org Oxford University Press 2014-03-01 2013-10-18 /pmc/articles/PMC3933873/ /pubmed/24142950 http://dx.doi.org/10.1093/bioinformatics/btt593 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Papers Zhang, Jiajie Kobert, Kassian Flouri, Tomáš Stamatakis, Alexandros PEAR: a fast and accurate Illumina Paired-End reAd mergeR
title	PEAR: a fast and accurate Illumina Paired-End reAd mergeR
title_full	PEAR: a fast and accurate Illumina Paired-End reAd mergeR
title_fullStr	PEAR: a fast and accurate Illumina Paired-End reAd mergeR
title_full_unstemmed	PEAR: a fast and accurate Illumina Paired-End reAd mergeR
title_short	PEAR: a fast and accurate Illumina Paired-End reAd mergeR
title_sort	pear: a fast and accurate illumina paired-end read merger
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3933873/ https://www.ncbi.nlm.nih.gov/pubmed/24142950 http://dx.doi.org/10.1093/bioinformatics/btt593
work_keys_str_mv	AT zhangjiajie pearafastandaccurateilluminapairedendreadmerger AT kobertkassian pearafastandaccurateilluminapairedendreadmerger AT flouritomas pearafastandaccurateilluminapairedendreadmerger AT stamatakisalexandros pearafastandaccurateilluminapairedendreadmerger

PEAR: a fast and accurate Illumina Paired-End reAd mergeR

Ejemplares similares