Cargando…

ensemblQueryR: fast, flexible and high-throughput querying of Ensembl LD API endpoints in R

We present ensemblQueryR, an R package for querying Ensembl linkage disequilibrium (LD) endpoints. This package is flexible, fast and user-friendly, and optimised for high-throughput querying. ensemblQueryR uses functions that are intuitive and amenable to custom code integration, familiar R object...

Descripción completa

Detalles Bibliográficos
Autores principales: Fairbrother-Browne, Aine, García-Ruiz, Sonia, Hertfelder Reynolds, Regina, Ryten, Mina, Hodgkinson, Alan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: GigaScience Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10507293/
https://www.ncbi.nlm.nih.gov/pubmed/37732134
http://dx.doi.org/10.46471/gigabyte.91
Descripción
Sumario:We present ensemblQueryR, an R package for querying Ensembl linkage disequilibrium (LD) endpoints. This package is flexible, fast and user-friendly, and optimised for high-throughput querying. ensemblQueryR uses functions that are intuitive and amenable to custom code integration, familiar R object types as inputs and outputs as well as providing parallelisation functionality. For each Ensembl LD endpoint, ensemblQueryR provides two functions, permitting both single- and multi-query modes of operation. The multi-query functions are optimised for large query sizes and provide optional parallelisation to leverage available computational resources and minimise processing time. We demonstrate improved computational performance of ensemblQueryR over an exisiting tool in terms of random access memory (RAM) usage and speed, delivering a 10-fold speed increase whilst using a third of the RAM. Finally, ensemblQueryR is near-agnostic to operating system and computational architecture through Docker and singularity images, making this tool widely accessible to the scientific community.