Cargando…

Multivariate genome-wide association analysis by iterative hard thresholding

MOTIVATION: In a genome-wide association study, analyzing multiple correlated traits simultaneously is potentially superior to analyzing the traits one by one. Standard methods for multivariate genome-wide association study operate marker-by-marker and are computationally intensive. RESULTS: We pres...

Descripción completa

Detalles Bibliográficos
Autores principales: Chu, Benjamin B, Ko, Seyoon, Zhou, Jin J, Jensen, Aubrey, Zhou, Hua, Sinsheimer, Janet S, Lange, Kenneth
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10133532/
https://www.ncbi.nlm.nih.gov/pubmed/37067496
http://dx.doi.org/10.1093/bioinformatics/btad193
_version_ 1785031588306747392
author Chu, Benjamin B
Ko, Seyoon
Zhou, Jin J
Jensen, Aubrey
Zhou, Hua
Sinsheimer, Janet S
Lange, Kenneth
author_facet Chu, Benjamin B
Ko, Seyoon
Zhou, Jin J
Jensen, Aubrey
Zhou, Hua
Sinsheimer, Janet S
Lange, Kenneth
author_sort Chu, Benjamin B
collection PubMed
description MOTIVATION: In a genome-wide association study, analyzing multiple correlated traits simultaneously is potentially superior to analyzing the traits one by one. Standard methods for multivariate genome-wide association study operate marker-by-marker and are computationally intensive. RESULTS: We present a sparsity constrained regression algorithm for multivariate genome-wide association study based on iterative hard thresholding and implement it in a convenient Julia package MendelIHT.jl. In simulation studies with up to 100 quantitative traits, iterative hard thresholding exhibits similar true positive rates, smaller false positive rates, and faster execution times than GEMMA’s linear mixed models and mv-PLINK’s canonical correlation analysis. On UK Biobank data with 470 228 variants, MendelIHT completed a three-trait joint analysis ([Formula: see text]) in 20 h and an 18-trait joint analysis ([Formula: see text]) in 53 h with an 80 GB memory footprint. In short, MendelIHT enables geneticists to fit a single regression model that simultaneously considers the effect of all SNPs and dozens of traits. AVAILABILITY AND IMPLEMENTATION: Software, documentation, and scripts to reproduce our results are available from https://github.com/OpenMendel/MendelIHT.jl.
format Online
Article
Text
id pubmed-10133532
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101335322023-04-28 Multivariate genome-wide association analysis by iterative hard thresholding Chu, Benjamin B Ko, Seyoon Zhou, Jin J Jensen, Aubrey Zhou, Hua Sinsheimer, Janet S Lange, Kenneth Bioinformatics Original Paper MOTIVATION: In a genome-wide association study, analyzing multiple correlated traits simultaneously is potentially superior to analyzing the traits one by one. Standard methods for multivariate genome-wide association study operate marker-by-marker and are computationally intensive. RESULTS: We present a sparsity constrained regression algorithm for multivariate genome-wide association study based on iterative hard thresholding and implement it in a convenient Julia package MendelIHT.jl. In simulation studies with up to 100 quantitative traits, iterative hard thresholding exhibits similar true positive rates, smaller false positive rates, and faster execution times than GEMMA’s linear mixed models and mv-PLINK’s canonical correlation analysis. On UK Biobank data with 470 228 variants, MendelIHT completed a three-trait joint analysis ([Formula: see text]) in 20 h and an 18-trait joint analysis ([Formula: see text]) in 53 h with an 80 GB memory footprint. In short, MendelIHT enables geneticists to fit a single regression model that simultaneously considers the effect of all SNPs and dozens of traits. AVAILABILITY AND IMPLEMENTATION: Software, documentation, and scripts to reproduce our results are available from https://github.com/OpenMendel/MendelIHT.jl. Oxford University Press 2023-04-17 /pmc/articles/PMC10133532/ /pubmed/37067496 http://dx.doi.org/10.1093/bioinformatics/btad193 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Chu, Benjamin B
Ko, Seyoon
Zhou, Jin J
Jensen, Aubrey
Zhou, Hua
Sinsheimer, Janet S
Lange, Kenneth
Multivariate genome-wide association analysis by iterative hard thresholding
title Multivariate genome-wide association analysis by iterative hard thresholding
title_full Multivariate genome-wide association analysis by iterative hard thresholding
title_fullStr Multivariate genome-wide association analysis by iterative hard thresholding
title_full_unstemmed Multivariate genome-wide association analysis by iterative hard thresholding
title_short Multivariate genome-wide association analysis by iterative hard thresholding
title_sort multivariate genome-wide association analysis by iterative hard thresholding
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10133532/
https://www.ncbi.nlm.nih.gov/pubmed/37067496
http://dx.doi.org/10.1093/bioinformatics/btad193
work_keys_str_mv AT chubenjaminb multivariategenomewideassociationanalysisbyiterativehardthresholding
AT koseyoon multivariategenomewideassociationanalysisbyiterativehardthresholding
AT zhoujinj multivariategenomewideassociationanalysisbyiterativehardthresholding
AT jensenaubrey multivariategenomewideassociationanalysisbyiterativehardthresholding
AT zhouhua multivariategenomewideassociationanalysisbyiterativehardthresholding
AT sinsheimerjanets multivariategenomewideassociationanalysisbyiterativehardthresholding
AT langekenneth multivariategenomewideassociationanalysisbyiterativehardthresholding