Cargando…

Efficient querying of genomic reference databases with gget

MOTIVATION: A recurring challenge in interpreting genomic data is the assessment of results in the context of existing reference databases. With the increasing number of command line and Python users, there is a need for tools implementing automated, easy programmatic access to curated reference inf...

Descripción completa

Detalles Bibliográficos
Autores principales: Luebbert, Laura, Pachter, Lior
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9835474/
https://www.ncbi.nlm.nih.gov/pubmed/36610989
http://dx.doi.org/10.1093/bioinformatics/btac836
Descripción
Sumario:MOTIVATION: A recurring challenge in interpreting genomic data is the assessment of results in the context of existing reference databases. With the increasing number of command line and Python users, there is a need for tools implementing automated, easy programmatic access to curated reference information stored in a diverse collection of large, public genomic databases. RESULTS: gget is a free and open-source command line tool and Python package that enables efficient querying of genomic reference databases, such as Ensembl. gget consists of a collection of separate but interoperable modules, each designed to facilitate one type of database querying required for genomic data analysis in a single line of code. AVAILABILITY AND IMPLEMENTATION: The manual and source code are available at https://github.com/pachterlab/gget. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.