Cargando…

GPU empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1D and 2D GWAS

Genome-wide association study (GWAS) is a powerful approach that has revolutionized the field of quantitative genetics. Two-dimensional GWAS that accounts for epistatic genetic effects needs to consider the effects of marker pairs, thus quadratic genetic variants, compared to one-dimensional GWAS th...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Wenchao, Dai, Xinbin, Xu, Shizhong, Zhao, Patrick X
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671369/
https://www.ncbi.nlm.nih.gov/pubmed/33575561
http://dx.doi.org/10.1093/nargab/lqz009
_version_ 1783610916265263104
author Zhang, Wenchao
Dai, Xinbin
Xu, Shizhong
Zhao, Patrick X
author_facet Zhang, Wenchao
Dai, Xinbin
Xu, Shizhong
Zhao, Patrick X
author_sort Zhang, Wenchao
collection PubMed
description Genome-wide association study (GWAS) is a powerful approach that has revolutionized the field of quantitative genetics. Two-dimensional GWAS that accounts for epistatic genetic effects needs to consider the effects of marker pairs, thus quadratic genetic variants, compared to one-dimensional GWAS that accounts for individual genetic variants. Calculating genome-wide kinship matrices in GWAS that account for relationships among individuals represented by ultra-high dimensional genetic variants is computationally challenging. Fortunately, kinship matrix calculation involves pure matrix operations and the algorithms can be parallelized, particular on graphics processing unit (GPU)-empowered high-performance computing (HPC) architectures. We have devised a new method and two pipelines: KMC1D and KMC2D for kinship matrix calculation with high-dimensional genetic variants, respectively, facilitating 1D and 2D GWAS analyses. We first divide the ultra-high-dimensional markers and marker pairs into successive blocks. We then calculate the kinship matrix for each block and merge together the block-wise kinship matrices to form the genome-wide kinship matrix. All the matrix operations have been parallelized using GPU kernels on our NVIDIA GPU-accelerated server platform. The performance analyses show that the calculation speed of KMC1D and KMC2D can be accelerated by 100–400 times over the conventional CPU-based computing.
format Online
Article
Text
id pubmed-7671369
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-76713692021-02-10 GPU empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1D and 2D GWAS Zhang, Wenchao Dai, Xinbin Xu, Shizhong Zhao, Patrick X NAR Genom Bioinform Methods Article Genome-wide association study (GWAS) is a powerful approach that has revolutionized the field of quantitative genetics. Two-dimensional GWAS that accounts for epistatic genetic effects needs to consider the effects of marker pairs, thus quadratic genetic variants, compared to one-dimensional GWAS that accounts for individual genetic variants. Calculating genome-wide kinship matrices in GWAS that account for relationships among individuals represented by ultra-high dimensional genetic variants is computationally challenging. Fortunately, kinship matrix calculation involves pure matrix operations and the algorithms can be parallelized, particular on graphics processing unit (GPU)-empowered high-performance computing (HPC) architectures. We have devised a new method and two pipelines: KMC1D and KMC2D for kinship matrix calculation with high-dimensional genetic variants, respectively, facilitating 1D and 2D GWAS analyses. We first divide the ultra-high-dimensional markers and marker pairs into successive blocks. We then calculate the kinship matrix for each block and merge together the block-wise kinship matrices to form the genome-wide kinship matrix. All the matrix operations have been parallelized using GPU kernels on our NVIDIA GPU-accelerated server platform. The performance analyses show that the calculation speed of KMC1D and KMC2D can be accelerated by 100–400 times over the conventional CPU-based computing. Oxford University Press 2019-10-03 /pmc/articles/PMC7671369/ /pubmed/33575561 http://dx.doi.org/10.1093/nargab/lqz009 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Article
Zhang, Wenchao
Dai, Xinbin
Xu, Shizhong
Zhao, Patrick X
GPU empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1D and 2D GWAS
title GPU empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1D and 2D GWAS
title_full GPU empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1D and 2D GWAS
title_fullStr GPU empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1D and 2D GWAS
title_full_unstemmed GPU empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1D and 2D GWAS
title_short GPU empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1D and 2D GWAS
title_sort gpu empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1d and 2d gwas
topic Methods Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671369/
https://www.ncbi.nlm.nih.gov/pubmed/33575561
http://dx.doi.org/10.1093/nargab/lqz009
work_keys_str_mv AT zhangwenchao gpuempoweredpipelinesforcalculatinggenomewidekinshipmatriceswithultrahighdimensionalgeneticvariantsandfacilitating1dand2dgwas
AT daixinbin gpuempoweredpipelinesforcalculatinggenomewidekinshipmatriceswithultrahighdimensionalgeneticvariantsandfacilitating1dand2dgwas
AT xushizhong gpuempoweredpipelinesforcalculatinggenomewidekinshipmatriceswithultrahighdimensionalgeneticvariantsandfacilitating1dand2dgwas
AT zhaopatrickx gpuempoweredpipelinesforcalculatinggenomewidekinshipmatriceswithultrahighdimensionalgeneticvariantsandfacilitating1dand2dgwas