Cargando…

Bison: bisulfite alignment on nodes of a cluster

BACKGROUND: DNA methylation changes are associated with a wide array of biological processes. Bisulfite conversion of DNA followed by high-throughput sequencing is increasingly being used to assess genome-wide methylation at single-base resolution. The relative slowness of most commonly used aligner...

Descripción completa

Detalles Bibliográficos
Autores principales: Ryan, Devon Patrick, Ehninger, Dan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4287502/
https://www.ncbi.nlm.nih.gov/pubmed/25326660
http://dx.doi.org/10.1186/1471-2105-15-337
_version_ 1782351798564028416
author Ryan, Devon Patrick
Ehninger, Dan
author_facet Ryan, Devon Patrick
Ehninger, Dan
author_sort Ryan, Devon Patrick
collection PubMed
description BACKGROUND: DNA methylation changes are associated with a wide array of biological processes. Bisulfite conversion of DNA followed by high-throughput sequencing is increasingly being used to assess genome-wide methylation at single-base resolution. The relative slowness of most commonly used aligners for processing such data introduces an unnecessarily long delay between receipt of raw data and statistical analysis. While this process can be sped-up by using computer clusters, current tools are not designed with them in mind and end-users must create such implementations themselves. RESULTS: Here, we present a novel BS-seq aligner, Bison, which exploits multiple nodes of a computer cluster to speed up this process and also has increased accuracy. Bison is accompanied by a variety of helper programs and scripts to ease, as much as possible, the process of quality control and preparing results for statistical analysis by a variety of popular R packages. Bison is also accompanied by bison_herd, a variant of Bison with the same output but that can scale to a semi-arbitrary number of nodes, with concomitant increased demands on the underlying message passing interface implementation. CONCLUSIONS: Bison is a new bisulfite-converted short-read aligner providing end users easier scalability for performance gains, more accurate alignments, and a convenient pathway for quality controlling alignments and converting methylation calls into a form appropriate for statistical analysis. Bison and the more scalable bison_herd are natively able to utilize multiple nodes of a computer cluster simultaneously and serve to simplify to the process of creating analysis pipelines.
format Online
Article
Text
id pubmed-4287502
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42875022015-01-09 Bison: bisulfite alignment on nodes of a cluster Ryan, Devon Patrick Ehninger, Dan BMC Bioinformatics Software BACKGROUND: DNA methylation changes are associated with a wide array of biological processes. Bisulfite conversion of DNA followed by high-throughput sequencing is increasingly being used to assess genome-wide methylation at single-base resolution. The relative slowness of most commonly used aligners for processing such data introduces an unnecessarily long delay between receipt of raw data and statistical analysis. While this process can be sped-up by using computer clusters, current tools are not designed with them in mind and end-users must create such implementations themselves. RESULTS: Here, we present a novel BS-seq aligner, Bison, which exploits multiple nodes of a computer cluster to speed up this process and also has increased accuracy. Bison is accompanied by a variety of helper programs and scripts to ease, as much as possible, the process of quality control and preparing results for statistical analysis by a variety of popular R packages. Bison is also accompanied by bison_herd, a variant of Bison with the same output but that can scale to a semi-arbitrary number of nodes, with concomitant increased demands on the underlying message passing interface implementation. CONCLUSIONS: Bison is a new bisulfite-converted short-read aligner providing end users easier scalability for performance gains, more accurate alignments, and a convenient pathway for quality controlling alignments and converting methylation calls into a form appropriate for statistical analysis. Bison and the more scalable bison_herd are natively able to utilize multiple nodes of a computer cluster simultaneously and serve to simplify to the process of creating analysis pipelines. BioMed Central 2014-10-18 /pmc/articles/PMC4287502/ /pubmed/25326660 http://dx.doi.org/10.1186/1471-2105-15-337 Text en © Ryan and Ehninger; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Ryan, Devon Patrick
Ehninger, Dan
Bison: bisulfite alignment on nodes of a cluster
title Bison: bisulfite alignment on nodes of a cluster
title_full Bison: bisulfite alignment on nodes of a cluster
title_fullStr Bison: bisulfite alignment on nodes of a cluster
title_full_unstemmed Bison: bisulfite alignment on nodes of a cluster
title_short Bison: bisulfite alignment on nodes of a cluster
title_sort bison: bisulfite alignment on nodes of a cluster
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4287502/
https://www.ncbi.nlm.nih.gov/pubmed/25326660
http://dx.doi.org/10.1186/1471-2105-15-337
work_keys_str_mv AT ryandevonpatrick bisonbisulfitealignmentonnodesofacluster
AT ehningerdan bisonbisulfitealignmentonnodesofacluster