Cargando…

Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships

BACKGROUND: Heritability and genetic correlation can be estimated from genome-wide single-nucleotide polymorphism (SNP) data using various methods. We recently developed multivariate genomic-relatedness-based restricted maximum likelihood (MGREML) for statistically and computationally efficient esti...

Descripción completa

Detalles Bibliográficos
Autores principales: De Vlaming, Ronald, Slob, Eric A. W., Groenen, Patrick J. F., Rietveld, Cornelius A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9327374/
https://www.ncbi.nlm.nih.gov/pubmed/35896974
http://dx.doi.org/10.1186/s12859-022-04835-3
_version_ 1784757493934587904
author De Vlaming, Ronald
Slob, Eric A. W.
Groenen, Patrick J. F.
Rietveld, Cornelius A.
author_facet De Vlaming, Ronald
Slob, Eric A. W.
Groenen, Patrick J. F.
Rietveld, Cornelius A.
author_sort De Vlaming, Ronald
collection PubMed
description BACKGROUND: Heritability and genetic correlation can be estimated from genome-wide single-nucleotide polymorphism (SNP) data using various methods. We recently developed multivariate genomic-relatedness-based restricted maximum likelihood (MGREML) for statistically and computationally efficient estimation of SNP-based heritability ([Formula: see text] ) and genetic correlation ([Formula: see text] ) across many traits in large datasets. Here, we extend MGREML by allowing it to fit and perform tests on user-specified factor models, while preserving the low computational complexity. RESULTS: Using simulations, we show that MGREML yields consistent estimates and valid inferences for such factor models at low computational cost (e.g., for data on 50 traits and 20,000 individuals, a saturated model involving 50 [Formula: see text] ’s, 1225 [Formula: see text] ’s, and 50 fixed effects is estimated and compared to a restricted model in less than one hour on a single notebook with two 2.7 GHz cores and 16 GB of RAM). Using repeated measures of height and body mass index from the US Health and Retirement Study, we illustrate the ability of MGREML to estimate a factor model and test whether it fits the data better than a nested model. The MGREML tool, the simulation code, and an extensive tutorial are freely available at https://github.com/devlaming/mgreml/. CONCLUSION: MGREML can now be used to estimate multivariate factor structures and perform inferences on such factor models at low computational cost. This new feature enables simple structural equation modeling using MGREML, allowing researchers to specify, estimate, and compare genetic factor models of their choosing using SNP data.
format Online
Article
Text
id pubmed-9327374
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93273742022-07-28 Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships De Vlaming, Ronald Slob, Eric A. W. Groenen, Patrick J. F. Rietveld, Cornelius A. BMC Bioinformatics Software BACKGROUND: Heritability and genetic correlation can be estimated from genome-wide single-nucleotide polymorphism (SNP) data using various methods. We recently developed multivariate genomic-relatedness-based restricted maximum likelihood (MGREML) for statistically and computationally efficient estimation of SNP-based heritability ([Formula: see text] ) and genetic correlation ([Formula: see text] ) across many traits in large datasets. Here, we extend MGREML by allowing it to fit and perform tests on user-specified factor models, while preserving the low computational complexity. RESULTS: Using simulations, we show that MGREML yields consistent estimates and valid inferences for such factor models at low computational cost (e.g., for data on 50 traits and 20,000 individuals, a saturated model involving 50 [Formula: see text] ’s, 1225 [Formula: see text] ’s, and 50 fixed effects is estimated and compared to a restricted model in less than one hour on a single notebook with two 2.7 GHz cores and 16 GB of RAM). Using repeated measures of height and body mass index from the US Health and Retirement Study, we illustrate the ability of MGREML to estimate a factor model and test whether it fits the data better than a nested model. The MGREML tool, the simulation code, and an extensive tutorial are freely available at https://github.com/devlaming/mgreml/. CONCLUSION: MGREML can now be used to estimate multivariate factor structures and perform inferences on such factor models at low computational cost. This new feature enables simple structural equation modeling using MGREML, allowing researchers to specify, estimate, and compare genetic factor models of their choosing using SNP data. BioMed Central 2022-07-27 /pmc/articles/PMC9327374/ /pubmed/35896974 http://dx.doi.org/10.1186/s12859-022-04835-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
De Vlaming, Ronald
Slob, Eric A. W.
Groenen, Patrick J. F.
Rietveld, Cornelius A.
Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships
title Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships
title_full Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships
title_fullStr Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships
title_full_unstemmed Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships
title_short Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships
title_sort multivariate estimation of factor structures of complex traits using snp-based genomic relationships
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9327374/
https://www.ncbi.nlm.nih.gov/pubmed/35896974
http://dx.doi.org/10.1186/s12859-022-04835-3
work_keys_str_mv AT devlamingronald multivariateestimationoffactorstructuresofcomplextraitsusingsnpbasedgenomicrelationships
AT slobericaw multivariateestimationoffactorstructuresofcomplextraitsusingsnpbasedgenomicrelationships
AT groenenpatrickjf multivariateestimationoffactorstructuresofcomplextraitsusingsnpbasedgenomicrelationships
AT rietveldcorneliusa multivariateestimationoffactorstructuresofcomplextraitsusingsnpbasedgenomicrelationships