Cargando…
Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships
BACKGROUND: Heritability and genetic correlation can be estimated from genome-wide single-nucleotide polymorphism (SNP) data using various methods. We recently developed multivariate genomic-relatedness-based restricted maximum likelihood (MGREML) for statistically and computationally efficient esti...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9327374/ https://www.ncbi.nlm.nih.gov/pubmed/35896974 http://dx.doi.org/10.1186/s12859-022-04835-3 |
_version_ | 1784757493934587904 |
---|---|
author | De Vlaming, Ronald Slob, Eric A. W. Groenen, Patrick J. F. Rietveld, Cornelius A. |
author_facet | De Vlaming, Ronald Slob, Eric A. W. Groenen, Patrick J. F. Rietveld, Cornelius A. |
author_sort | De Vlaming, Ronald |
collection | PubMed |
description | BACKGROUND: Heritability and genetic correlation can be estimated from genome-wide single-nucleotide polymorphism (SNP) data using various methods. We recently developed multivariate genomic-relatedness-based restricted maximum likelihood (MGREML) for statistically and computationally efficient estimation of SNP-based heritability ([Formula: see text] ) and genetic correlation ([Formula: see text] ) across many traits in large datasets. Here, we extend MGREML by allowing it to fit and perform tests on user-specified factor models, while preserving the low computational complexity. RESULTS: Using simulations, we show that MGREML yields consistent estimates and valid inferences for such factor models at low computational cost (e.g., for data on 50 traits and 20,000 individuals, a saturated model involving 50 [Formula: see text] ’s, 1225 [Formula: see text] ’s, and 50 fixed effects is estimated and compared to a restricted model in less than one hour on a single notebook with two 2.7 GHz cores and 16 GB of RAM). Using repeated measures of height and body mass index from the US Health and Retirement Study, we illustrate the ability of MGREML to estimate a factor model and test whether it fits the data better than a nested model. The MGREML tool, the simulation code, and an extensive tutorial are freely available at https://github.com/devlaming/mgreml/. CONCLUSION: MGREML can now be used to estimate multivariate factor structures and perform inferences on such factor models at low computational cost. This new feature enables simple structural equation modeling using MGREML, allowing researchers to specify, estimate, and compare genetic factor models of their choosing using SNP data. |
format | Online Article Text |
id | pubmed-9327374 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-93273742022-07-28 Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships De Vlaming, Ronald Slob, Eric A. W. Groenen, Patrick J. F. Rietveld, Cornelius A. BMC Bioinformatics Software BACKGROUND: Heritability and genetic correlation can be estimated from genome-wide single-nucleotide polymorphism (SNP) data using various methods. We recently developed multivariate genomic-relatedness-based restricted maximum likelihood (MGREML) for statistically and computationally efficient estimation of SNP-based heritability ([Formula: see text] ) and genetic correlation ([Formula: see text] ) across many traits in large datasets. Here, we extend MGREML by allowing it to fit and perform tests on user-specified factor models, while preserving the low computational complexity. RESULTS: Using simulations, we show that MGREML yields consistent estimates and valid inferences for such factor models at low computational cost (e.g., for data on 50 traits and 20,000 individuals, a saturated model involving 50 [Formula: see text] ’s, 1225 [Formula: see text] ’s, and 50 fixed effects is estimated and compared to a restricted model in less than one hour on a single notebook with two 2.7 GHz cores and 16 GB of RAM). Using repeated measures of height and body mass index from the US Health and Retirement Study, we illustrate the ability of MGREML to estimate a factor model and test whether it fits the data better than a nested model. The MGREML tool, the simulation code, and an extensive tutorial are freely available at https://github.com/devlaming/mgreml/. CONCLUSION: MGREML can now be used to estimate multivariate factor structures and perform inferences on such factor models at low computational cost. This new feature enables simple structural equation modeling using MGREML, allowing researchers to specify, estimate, and compare genetic factor models of their choosing using SNP data. BioMed Central 2022-07-27 /pmc/articles/PMC9327374/ /pubmed/35896974 http://dx.doi.org/10.1186/s12859-022-04835-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software De Vlaming, Ronald Slob, Eric A. W. Groenen, Patrick J. F. Rietveld, Cornelius A. Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships |
title | Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships |
title_full | Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships |
title_fullStr | Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships |
title_full_unstemmed | Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships |
title_short | Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships |
title_sort | multivariate estimation of factor structures of complex traits using snp-based genomic relationships |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9327374/ https://www.ncbi.nlm.nih.gov/pubmed/35896974 http://dx.doi.org/10.1186/s12859-022-04835-3 |
work_keys_str_mv | AT devlamingronald multivariateestimationoffactorstructuresofcomplextraitsusingsnpbasedgenomicrelationships AT slobericaw multivariateestimationoffactorstructuresofcomplextraitsusingsnpbasedgenomicrelationships AT groenenpatrickjf multivariateestimationoffactorstructuresofcomplextraitsusingsnpbasedgenomicrelationships AT rietveldcorneliusa multivariateestimationoffactorstructuresofcomplextraitsusingsnpbasedgenomicrelationships |