Cargando…
Optimizing performance of GATK workflows using Apache Arrow In-Memory data framework
BACKGROUND: Immense improvements in sequencing technologies enable producing large amounts of high throughput and cost effective next-generation sequencing (NGS) data. This data needs to be processed efficiently for further downstream analyses. Computing systems need this large amounts of data close...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7677819/ https://www.ncbi.nlm.nih.gov/pubmed/33208101 http://dx.doi.org/10.1186/s12864-020-07013-y |