Cargando…

Efficient Acceleration of the Pair-HMMs Forward Algorithm for GATK HaplotypeCaller on Graphics Processing Units

GATK HaplotypeCaller (HC) is a popular variant caller, which is widely used to identify variants in complex genomes. However, due to its high variants detection accuracy, it suffers from long execution time. In GATK HC, the pair-HMMs forward algorithm accounts for a large percentage of the total exe...

Descripción completa

Detalles Bibliográficos
Autores principales: Ren, Shanshan, Bertels, Koen, Al-Ars, Zaid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5858735/
https://www.ncbi.nlm.nih.gov/pubmed/29568218
http://dx.doi.org/10.1177/1176934318760543
_version_ 1783307703715627008
author Ren, Shanshan
Bertels, Koen
Al-Ars, Zaid
author_facet Ren, Shanshan
Bertels, Koen
Al-Ars, Zaid
author_sort Ren, Shanshan
collection PubMed
description GATK HaplotypeCaller (HC) is a popular variant caller, which is widely used to identify variants in complex genomes. However, due to its high variants detection accuracy, it suffers from long execution time. In GATK HC, the pair-HMMs forward algorithm accounts for a large percentage of the total execution time. This article proposes to accelerate the pair-HMMs forward algorithm on graphics processing units (GPUs) to improve the performance of GATK HC. This article presents several GPU-based implementations of the pair-HMMs forward algorithm. It also analyzes the performance bottlenecks of the implementations on an NVIDIA Tesla K40 card with various data sets. Based on these results and the characteristics of GATK HC, we are able to identify the GPU-based implementations with the highest performance for the various analyzed data sets. Experimental results show that the GPU-based implementations of the pair-HMMs forward algorithm achieve a speedup of up to 5.47× over existing GPU-based implementations.
format Online
Article
Text
id pubmed-5858735
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-58587352018-03-22 Efficient Acceleration of the Pair-HMMs Forward Algorithm for GATK HaplotypeCaller on Graphics Processing Units Ren, Shanshan Bertels, Koen Al-Ars, Zaid Evol Bioinform Online Review: Special Collection: Computational Bioinformatics Tools for Evolutionary Genomics GATK HaplotypeCaller (HC) is a popular variant caller, which is widely used to identify variants in complex genomes. However, due to its high variants detection accuracy, it suffers from long execution time. In GATK HC, the pair-HMMs forward algorithm accounts for a large percentage of the total execution time. This article proposes to accelerate the pair-HMMs forward algorithm on graphics processing units (GPUs) to improve the performance of GATK HC. This article presents several GPU-based implementations of the pair-HMMs forward algorithm. It also analyzes the performance bottlenecks of the implementations on an NVIDIA Tesla K40 card with various data sets. Based on these results and the characteristics of GATK HC, we are able to identify the GPU-based implementations with the highest performance for the various analyzed data sets. Experimental results show that the GPU-based implementations of the pair-HMMs forward algorithm achieve a speedup of up to 5.47× over existing GPU-based implementations. SAGE Publications 2018-03-12 /pmc/articles/PMC5858735/ /pubmed/29568218 http://dx.doi.org/10.1177/1176934318760543 Text en © The Author(s) 2018 http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Review: Special Collection: Computational Bioinformatics Tools for Evolutionary Genomics
Ren, Shanshan
Bertels, Koen
Al-Ars, Zaid
Efficient Acceleration of the Pair-HMMs Forward Algorithm for GATK HaplotypeCaller on Graphics Processing Units
title Efficient Acceleration of the Pair-HMMs Forward Algorithm for GATK HaplotypeCaller on Graphics Processing Units
title_full Efficient Acceleration of the Pair-HMMs Forward Algorithm for GATK HaplotypeCaller on Graphics Processing Units
title_fullStr Efficient Acceleration of the Pair-HMMs Forward Algorithm for GATK HaplotypeCaller on Graphics Processing Units
title_full_unstemmed Efficient Acceleration of the Pair-HMMs Forward Algorithm for GATK HaplotypeCaller on Graphics Processing Units
title_short Efficient Acceleration of the Pair-HMMs Forward Algorithm for GATK HaplotypeCaller on Graphics Processing Units
title_sort efficient acceleration of the pair-hmms forward algorithm for gatk haplotypecaller on graphics processing units
topic Review: Special Collection: Computational Bioinformatics Tools for Evolutionary Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5858735/
https://www.ncbi.nlm.nih.gov/pubmed/29568218
http://dx.doi.org/10.1177/1176934318760543
work_keys_str_mv AT renshanshan efficientaccelerationofthepairhmmsforwardalgorithmforgatkhaplotypecallerongraphicsprocessingunits
AT bertelskoen efficientaccelerationofthepairhmmsforwardalgorithmforgatkhaplotypecallerongraphicsprocessingunits
AT alarszaid efficientaccelerationofthepairhmmsforwardalgorithmforgatkhaplotypecallerongraphicsprocessingunits