Cargando…
Scaling read aligners to hundreds of threads on general-purpose processors
MOTIVATION: General-purpose processors can now contain many dozens of processor cores and support hundreds of simultaneous threads of execution. To make best use of these threads, genomics software must contend with new and subtle computer architecture issues. We discuss some of these and propose me...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6361242/ https://www.ncbi.nlm.nih.gov/pubmed/30020410 http://dx.doi.org/10.1093/bioinformatics/bty648 |
_version_ | 1783392655542059008 |
---|---|
author | Langmead, Ben Wilks, Christopher Antonescu, Valentin Charles, Rone |
author_facet | Langmead, Ben Wilks, Christopher Antonescu, Valentin Charles, Rone |
author_sort | Langmead, Ben |
collection | PubMed |
description | MOTIVATION: General-purpose processors can now contain many dozens of processor cores and support hundreds of simultaneous threads of execution. To make best use of these threads, genomics software must contend with new and subtle computer architecture issues. We discuss some of these and propose methods for improving thread scaling in tools that analyze each read independently, such as read aligners. RESULTS: We implement these methods in new versions of Bowtie, Bowtie 2 and HISAT. We greatly improve thread scaling in many scenarios, including on the recent Intel Xeon Phi architecture. We also highlight how bottlenecks are exacerbated by variable-record-length file formats like FASTQ and suggest changes that enable superior scaling. AVAILABILITY AND IMPLEMENTATION: Experiments for this study: https://github.com/BenLangmead/bowtie-scaling. BOWTIE: http://bowtie-bio.sourceforge.net. BOWTIE 2: http://bowtie-bio.sourceforge.net/bowtie2. HISAT: http://www.ccb.jhu.edu/software/hisat SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6361242 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-63612422019-02-08 Scaling read aligners to hundreds of threads on general-purpose processors Langmead, Ben Wilks, Christopher Antonescu, Valentin Charles, Rone Bioinformatics Original Papers MOTIVATION: General-purpose processors can now contain many dozens of processor cores and support hundreds of simultaneous threads of execution. To make best use of these threads, genomics software must contend with new and subtle computer architecture issues. We discuss some of these and propose methods for improving thread scaling in tools that analyze each read independently, such as read aligners. RESULTS: We implement these methods in new versions of Bowtie, Bowtie 2 and HISAT. We greatly improve thread scaling in many scenarios, including on the recent Intel Xeon Phi architecture. We also highlight how bottlenecks are exacerbated by variable-record-length file formats like FASTQ and suggest changes that enable superior scaling. AVAILABILITY AND IMPLEMENTATION: Experiments for this study: https://github.com/BenLangmead/bowtie-scaling. BOWTIE: http://bowtie-bio.sourceforge.net. BOWTIE 2: http://bowtie-bio.sourceforge.net/bowtie2. HISAT: http://www.ccb.jhu.edu/software/hisat SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-02-01 2018-07-18 /pmc/articles/PMC6361242/ /pubmed/30020410 http://dx.doi.org/10.1093/bioinformatics/bty648 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Langmead, Ben Wilks, Christopher Antonescu, Valentin Charles, Rone Scaling read aligners to hundreds of threads on general-purpose processors |
title | Scaling read aligners to hundreds of threads on general-purpose processors |
title_full | Scaling read aligners to hundreds of threads on general-purpose processors |
title_fullStr | Scaling read aligners to hundreds of threads on general-purpose processors |
title_full_unstemmed | Scaling read aligners to hundreds of threads on general-purpose processors |
title_short | Scaling read aligners to hundreds of threads on general-purpose processors |
title_sort | scaling read aligners to hundreds of threads on general-purpose processors |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6361242/ https://www.ncbi.nlm.nih.gov/pubmed/30020410 http://dx.doi.org/10.1093/bioinformatics/bty648 |
work_keys_str_mv | AT langmeadben scalingreadalignerstohundredsofthreadsongeneralpurposeprocessors AT wilkschristopher scalingreadalignerstohundredsofthreadsongeneralpurposeprocessors AT antonescuvalentin scalingreadalignerstohundredsofthreadsongeneralpurposeprocessors AT charlesrone scalingreadalignerstohundredsofthreadsongeneralpurposeprocessors |