Cargando…

Bathometer: lightning fast depth-of-reads query

MOTIVATION: The query for the number of reads overlapping a given region is a common step in the analysis of Illumina sequencing data. Sometimes, these queries are not conveniently precomputable. It seems beneficial to make this kind of arbitrary query as fast and convenient as possible. RESULTS: We...

Descripción completa

Detalles Bibliográficos
Autores principales: Stenzel, U, Horn, S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9502157/
https://www.ncbi.nlm.nih.gov/pubmed/33983362
http://dx.doi.org/10.1093/bioinformatics/btab372
Descripción
Sumario:MOTIVATION: The query for the number of reads overlapping a given region is a common step in the analysis of Illumina sequencing data. Sometimes, these queries are not conveniently precomputable. It seems beneficial to make this kind of arbitrary query as fast and convenient as possible. RESULTS: We present Bathometer, a tool that indexes BAM files in a space efficient way, which allows ad hoc queries for the number of reads overlapping any given genomic region to be answered much more quickly than by counting with common tools such as Samtools, while incurring much less disk I/O. AVAILABILITYAND IMPLEMENTATION: Bathometer is implemented in C, licensed under the GNU General Public License version 3+ and freely downloadable from Bitbucket (https://bitbucket.org/ustenzel/bathometer) SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.