Cargando…
Performance of penalized maximum likelihood in estimation of genetic covariances matrices
BACKGROUND: Estimation of genetic covariance matrices for multivariate problems comprising more than a few traits is inherently problematic, since sampling variation increases dramatically with the number of traits. This paper investigates the efficacy of regularized estimation of covariance compone...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3331854/ https://www.ncbi.nlm.nih.gov/pubmed/22117894 http://dx.doi.org/10.1186/1297-9686-43-39 |
_version_ | 1782230156593594368 |
---|---|
author | Meyer, Karin |
author_facet | Meyer, Karin |
author_sort | Meyer, Karin |
collection | PubMed |
description | BACKGROUND: Estimation of genetic covariance matrices for multivariate problems comprising more than a few traits is inherently problematic, since sampling variation increases dramatically with the number of traits. This paper investigates the efficacy of regularized estimation of covariance components in a maximum likelihood framework, imposing a penalty on the likelihood designed to reduce sampling variation. In particular, penalties that "borrow strength" from the phenotypic covariance matrix are considered. METHODS: An extensive simulation study was carried out to investigate the reduction in average 'loss', i.e. the deviation in estimated matrices from the population values, and the accompanying bias for a range of parameter values and sample sizes. A number of penalties are examined, penalizing either the canonical eigenvalues or the genetic covariance or correlation matrices. In addition, several strategies to determine the amount of penalization to be applied, i.e. to estimate the appropriate tuning factor, are explored. RESULTS: It is shown that substantial reductions in loss for estimates of genetic covariance can be achieved for small to moderate sample sizes. While no penalty performed best overall, penalizing the variance among the estimated canonical eigenvalues on the logarithmic scale or shrinking the genetic towards the phenotypic correlation matrix appeared most advantageous. Estimating the tuning factor using cross-validation resulted in a loss reduction 10 to 15% less than that obtained if population values were known. Applying a mild penalty, chosen so that the deviation in likelihood from the maximum was non-significant, performed as well if not better than cross-validation and can be recommended as a pragmatic strategy. CONCLUSIONS: Penalized maximum likelihood estimation provides the means to 'make the most' of limited and precious data and facilitates more stable estimation for multi-dimensional analyses. It should become part of our everyday toolkit for multivariate estimation in quantitative genetics. |
format | Online Article Text |
id | pubmed-3331854 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-33318542012-04-23 Performance of penalized maximum likelihood in estimation of genetic covariances matrices Meyer, Karin Genet Sel Evol Research BACKGROUND: Estimation of genetic covariance matrices for multivariate problems comprising more than a few traits is inherently problematic, since sampling variation increases dramatically with the number of traits. This paper investigates the efficacy of regularized estimation of covariance components in a maximum likelihood framework, imposing a penalty on the likelihood designed to reduce sampling variation. In particular, penalties that "borrow strength" from the phenotypic covariance matrix are considered. METHODS: An extensive simulation study was carried out to investigate the reduction in average 'loss', i.e. the deviation in estimated matrices from the population values, and the accompanying bias for a range of parameter values and sample sizes. A number of penalties are examined, penalizing either the canonical eigenvalues or the genetic covariance or correlation matrices. In addition, several strategies to determine the amount of penalization to be applied, i.e. to estimate the appropriate tuning factor, are explored. RESULTS: It is shown that substantial reductions in loss for estimates of genetic covariance can be achieved for small to moderate sample sizes. While no penalty performed best overall, penalizing the variance among the estimated canonical eigenvalues on the logarithmic scale or shrinking the genetic towards the phenotypic correlation matrix appeared most advantageous. Estimating the tuning factor using cross-validation resulted in a loss reduction 10 to 15% less than that obtained if population values were known. Applying a mild penalty, chosen so that the deviation in likelihood from the maximum was non-significant, performed as well if not better than cross-validation and can be recommended as a pragmatic strategy. CONCLUSIONS: Penalized maximum likelihood estimation provides the means to 'make the most' of limited and precious data and facilitates more stable estimation for multi-dimensional analyses. It should become part of our everyday toolkit for multivariate estimation in quantitative genetics. BioMed Central 2011-11-27 /pmc/articles/PMC3331854/ /pubmed/22117894 http://dx.doi.org/10.1186/1297-9686-43-39 Text en Copyright ©2011 Meyer; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Meyer, Karin Performance of penalized maximum likelihood in estimation of genetic covariances matrices |
title | Performance of penalized maximum likelihood in estimation of genetic covariances matrices |
title_full | Performance of penalized maximum likelihood in estimation of genetic covariances matrices |
title_fullStr | Performance of penalized maximum likelihood in estimation of genetic covariances matrices |
title_full_unstemmed | Performance of penalized maximum likelihood in estimation of genetic covariances matrices |
title_short | Performance of penalized maximum likelihood in estimation of genetic covariances matrices |
title_sort | performance of penalized maximum likelihood in estimation of genetic covariances matrices |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3331854/ https://www.ncbi.nlm.nih.gov/pubmed/22117894 http://dx.doi.org/10.1186/1297-9686-43-39 |
work_keys_str_mv | AT meyerkarin performanceofpenalizedmaximumlikelihoodinestimationofgeneticcovariancesmatrices |