Cargando…
Bison: bisulfite alignment on nodes of a cluster
BACKGROUND: DNA methylation changes are associated with a wide array of biological processes. Bisulfite conversion of DNA followed by high-throughput sequencing is increasingly being used to assess genome-wide methylation at single-base resolution. The relative slowness of most commonly used aligner...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4287502/ https://www.ncbi.nlm.nih.gov/pubmed/25326660 http://dx.doi.org/10.1186/1471-2105-15-337 |
_version_ | 1782351798564028416 |
---|---|
author | Ryan, Devon Patrick Ehninger, Dan |
author_facet | Ryan, Devon Patrick Ehninger, Dan |
author_sort | Ryan, Devon Patrick |
collection | PubMed |
description | BACKGROUND: DNA methylation changes are associated with a wide array of biological processes. Bisulfite conversion of DNA followed by high-throughput sequencing is increasingly being used to assess genome-wide methylation at single-base resolution. The relative slowness of most commonly used aligners for processing such data introduces an unnecessarily long delay between receipt of raw data and statistical analysis. While this process can be sped-up by using computer clusters, current tools are not designed with them in mind and end-users must create such implementations themselves. RESULTS: Here, we present a novel BS-seq aligner, Bison, which exploits multiple nodes of a computer cluster to speed up this process and also has increased accuracy. Bison is accompanied by a variety of helper programs and scripts to ease, as much as possible, the process of quality control and preparing results for statistical analysis by a variety of popular R packages. Bison is also accompanied by bison_herd, a variant of Bison with the same output but that can scale to a semi-arbitrary number of nodes, with concomitant increased demands on the underlying message passing interface implementation. CONCLUSIONS: Bison is a new bisulfite-converted short-read aligner providing end users easier scalability for performance gains, more accurate alignments, and a convenient pathway for quality controlling alignments and converting methylation calls into a form appropriate for statistical analysis. Bison and the more scalable bison_herd are natively able to utilize multiple nodes of a computer cluster simultaneously and serve to simplify to the process of creating analysis pipelines. |
format | Online Article Text |
id | pubmed-4287502 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42875022015-01-09 Bison: bisulfite alignment on nodes of a cluster Ryan, Devon Patrick Ehninger, Dan BMC Bioinformatics Software BACKGROUND: DNA methylation changes are associated with a wide array of biological processes. Bisulfite conversion of DNA followed by high-throughput sequencing is increasingly being used to assess genome-wide methylation at single-base resolution. The relative slowness of most commonly used aligners for processing such data introduces an unnecessarily long delay between receipt of raw data and statistical analysis. While this process can be sped-up by using computer clusters, current tools are not designed with them in mind and end-users must create such implementations themselves. RESULTS: Here, we present a novel BS-seq aligner, Bison, which exploits multiple nodes of a computer cluster to speed up this process and also has increased accuracy. Bison is accompanied by a variety of helper programs and scripts to ease, as much as possible, the process of quality control and preparing results for statistical analysis by a variety of popular R packages. Bison is also accompanied by bison_herd, a variant of Bison with the same output but that can scale to a semi-arbitrary number of nodes, with concomitant increased demands on the underlying message passing interface implementation. CONCLUSIONS: Bison is a new bisulfite-converted short-read aligner providing end users easier scalability for performance gains, more accurate alignments, and a convenient pathway for quality controlling alignments and converting methylation calls into a form appropriate for statistical analysis. Bison and the more scalable bison_herd are natively able to utilize multiple nodes of a computer cluster simultaneously and serve to simplify to the process of creating analysis pipelines. BioMed Central 2014-10-18 /pmc/articles/PMC4287502/ /pubmed/25326660 http://dx.doi.org/10.1186/1471-2105-15-337 Text en © Ryan and Ehninger; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Ryan, Devon Patrick Ehninger, Dan Bison: bisulfite alignment on nodes of a cluster |
title | Bison: bisulfite alignment on nodes of a cluster |
title_full | Bison: bisulfite alignment on nodes of a cluster |
title_fullStr | Bison: bisulfite alignment on nodes of a cluster |
title_full_unstemmed | Bison: bisulfite alignment on nodes of a cluster |
title_short | Bison: bisulfite alignment on nodes of a cluster |
title_sort | bison: bisulfite alignment on nodes of a cluster |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4287502/ https://www.ncbi.nlm.nih.gov/pubmed/25326660 http://dx.doi.org/10.1186/1471-2105-15-337 |
work_keys_str_mv | AT ryandevonpatrick bisonbisulfitealignmentonnodesofacluster AT ehningerdan bisonbisulfitealignmentonnodesofacluster |