Cargando…

Optimizing performance of GATK workflows using Apache Arrow In-Memory data framework

BACKGROUND: Immense improvements in sequencing technologies enable producing large amounts of high throughput and cost effective next-generation sequencing (NGS) data. This data needs to be processed efficiently for further downstream analyses. Computing systems need this large amounts of data close...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmad, Tanveer, Ahmed, Nauman, Al-Ars, Zaid, Hofstee, H. Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7677819/
https://www.ncbi.nlm.nih.gov/pubmed/33208101
http://dx.doi.org/10.1186/s12864-020-07013-y

Ejemplares similares