Cargando…
BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters
Large datasets can be screened for sequences from a specific organism, quickly and with low memory requirements, by a data structure that supports time- and memory-efficient set membership queries. Bloom filters offer such queries but require that false positives be controlled. We present BioBloom T...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4816029/ https://www.ncbi.nlm.nih.gov/pubmed/25143290 http://dx.doi.org/10.1093/bioinformatics/btu558 |
_version_ | 1782424654782136320 |
---|---|
author | Chu, Justin Sadeghi, Sara Raymond, Anthony Jackman, Shaun D. Nip, Ka Ming Mar, Richard Mohamadi, Hamid Butterfield, Yaron S. Robertson, A. Gordon Birol, Inanç |
author_facet | Chu, Justin Sadeghi, Sara Raymond, Anthony Jackman, Shaun D. Nip, Ka Ming Mar, Richard Mohamadi, Hamid Butterfield, Yaron S. Robertson, A. Gordon Birol, Inanç |
author_sort | Chu, Justin |
collection | PubMed |
description | Large datasets can be screened for sequences from a specific organism, quickly and with low memory requirements, by a data structure that supports time- and memory-efficient set membership queries. Bloom filters offer such queries but require that false positives be controlled. We present BioBloom Tools, a Bloom filter-based sequence-screening tool that is faster than BWA, Bowtie 2 (popular alignment algorithms) and FACS (a membership query algorithm). It delivers accuracies comparable with these tools, controls false positives and has low memory requirements. Availability and implementaion: www.bcgsc.ca/platform/bioinfo/software/biobloomtools Contact: cjustin@bcgsc.ca or ibirol@bcgsc.ca Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4816029 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-48160292016-04-04 BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters Chu, Justin Sadeghi, Sara Raymond, Anthony Jackman, Shaun D. Nip, Ka Ming Mar, Richard Mohamadi, Hamid Butterfield, Yaron S. Robertson, A. Gordon Birol, Inanç Bioinformatics Applications Notes Large datasets can be screened for sequences from a specific organism, quickly and with low memory requirements, by a data structure that supports time- and memory-efficient set membership queries. Bloom filters offer such queries but require that false positives be controlled. We present BioBloom Tools, a Bloom filter-based sequence-screening tool that is faster than BWA, Bowtie 2 (popular alignment algorithms) and FACS (a membership query algorithm). It delivers accuracies comparable with these tools, controls false positives and has low memory requirements. Availability and implementaion: www.bcgsc.ca/platform/bioinfo/software/biobloomtools Contact: cjustin@bcgsc.ca or ibirol@bcgsc.ca Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-12-01 2014-08-20 /pmc/articles/PMC4816029/ /pubmed/25143290 http://dx.doi.org/10.1093/bioinformatics/btu558 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Applications Notes Chu, Justin Sadeghi, Sara Raymond, Anthony Jackman, Shaun D. Nip, Ka Ming Mar, Richard Mohamadi, Hamid Butterfield, Yaron S. Robertson, A. Gordon Birol, Inanç BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters |
title | BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters |
title_full | BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters |
title_fullStr | BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters |
title_full_unstemmed | BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters |
title_short | BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters |
title_sort | biobloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4816029/ https://www.ncbi.nlm.nih.gov/pubmed/25143290 http://dx.doi.org/10.1093/bioinformatics/btu558 |
work_keys_str_mv | AT chujustin biobloomtoolsfastaccurateandmemoryefficienthostspeciessequencescreeningusingbloomfilters AT sadeghisara biobloomtoolsfastaccurateandmemoryefficienthostspeciessequencescreeningusingbloomfilters AT raymondanthony biobloomtoolsfastaccurateandmemoryefficienthostspeciessequencescreeningusingbloomfilters AT jackmanshaund biobloomtoolsfastaccurateandmemoryefficienthostspeciessequencescreeningusingbloomfilters AT nipkaming biobloomtoolsfastaccurateandmemoryefficienthostspeciessequencescreeningusingbloomfilters AT marrichard biobloomtoolsfastaccurateandmemoryefficienthostspeciessequencescreeningusingbloomfilters AT mohamadihamid biobloomtoolsfastaccurateandmemoryefficienthostspeciessequencescreeningusingbloomfilters AT butterfieldyarons biobloomtoolsfastaccurateandmemoryefficienthostspeciessequencescreeningusingbloomfilters AT robertsonagordon biobloomtoolsfastaccurateandmemoryefficienthostspeciessequencescreeningusingbloomfilters AT birolinanc biobloomtoolsfastaccurateandmemoryefficienthostspeciessequencescreeningusingbloomfilters |