Cargando…

Performance of penalized maximum likelihood in estimation of genetic covariances matrices

BACKGROUND: Estimation of genetic covariance matrices for multivariate problems comprising more than a few traits is inherently problematic, since sampling variation increases dramatically with the number of traits. This paper investigates the efficacy of regularized estimation of covariance compone...

Descripción completa

Detalles Bibliográficos
Autor principal: Meyer, Karin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3331854/
https://www.ncbi.nlm.nih.gov/pubmed/22117894
http://dx.doi.org/10.1186/1297-9686-43-39
_version_ 1782230156593594368
author Meyer, Karin
author_facet Meyer, Karin
author_sort Meyer, Karin
collection PubMed
description BACKGROUND: Estimation of genetic covariance matrices for multivariate problems comprising more than a few traits is inherently problematic, since sampling variation increases dramatically with the number of traits. This paper investigates the efficacy of regularized estimation of covariance components in a maximum likelihood framework, imposing a penalty on the likelihood designed to reduce sampling variation. In particular, penalties that "borrow strength" from the phenotypic covariance matrix are considered. METHODS: An extensive simulation study was carried out to investigate the reduction in average 'loss', i.e. the deviation in estimated matrices from the population values, and the accompanying bias for a range of parameter values and sample sizes. A number of penalties are examined, penalizing either the canonical eigenvalues or the genetic covariance or correlation matrices. In addition, several strategies to determine the amount of penalization to be applied, i.e. to estimate the appropriate tuning factor, are explored. RESULTS: It is shown that substantial reductions in loss for estimates of genetic covariance can be achieved for small to moderate sample sizes. While no penalty performed best overall, penalizing the variance among the estimated canonical eigenvalues on the logarithmic scale or shrinking the genetic towards the phenotypic correlation matrix appeared most advantageous. Estimating the tuning factor using cross-validation resulted in a loss reduction 10 to 15% less than that obtained if population values were known. Applying a mild penalty, chosen so that the deviation in likelihood from the maximum was non-significant, performed as well if not better than cross-validation and can be recommended as a pragmatic strategy. CONCLUSIONS: Penalized maximum likelihood estimation provides the means to 'make the most' of limited and precious data and facilitates more stable estimation for multi-dimensional analyses. It should become part of our everyday toolkit for multivariate estimation in quantitative genetics.
format Online
Article
Text
id pubmed-3331854
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33318542012-04-23 Performance of penalized maximum likelihood in estimation of genetic covariances matrices Meyer, Karin Genet Sel Evol Research BACKGROUND: Estimation of genetic covariance matrices for multivariate problems comprising more than a few traits is inherently problematic, since sampling variation increases dramatically with the number of traits. This paper investigates the efficacy of regularized estimation of covariance components in a maximum likelihood framework, imposing a penalty on the likelihood designed to reduce sampling variation. In particular, penalties that "borrow strength" from the phenotypic covariance matrix are considered. METHODS: An extensive simulation study was carried out to investigate the reduction in average 'loss', i.e. the deviation in estimated matrices from the population values, and the accompanying bias for a range of parameter values and sample sizes. A number of penalties are examined, penalizing either the canonical eigenvalues or the genetic covariance or correlation matrices. In addition, several strategies to determine the amount of penalization to be applied, i.e. to estimate the appropriate tuning factor, are explored. RESULTS: It is shown that substantial reductions in loss for estimates of genetic covariance can be achieved for small to moderate sample sizes. While no penalty performed best overall, penalizing the variance among the estimated canonical eigenvalues on the logarithmic scale or shrinking the genetic towards the phenotypic correlation matrix appeared most advantageous. Estimating the tuning factor using cross-validation resulted in a loss reduction 10 to 15% less than that obtained if population values were known. Applying a mild penalty, chosen so that the deviation in likelihood from the maximum was non-significant, performed as well if not better than cross-validation and can be recommended as a pragmatic strategy. CONCLUSIONS: Penalized maximum likelihood estimation provides the means to 'make the most' of limited and precious data and facilitates more stable estimation for multi-dimensional analyses. It should become part of our everyday toolkit for multivariate estimation in quantitative genetics. BioMed Central 2011-11-27 /pmc/articles/PMC3331854/ /pubmed/22117894 http://dx.doi.org/10.1186/1297-9686-43-39 Text en Copyright ©2011 Meyer; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Meyer, Karin
Performance of penalized maximum likelihood in estimation of genetic covariances matrices
title Performance of penalized maximum likelihood in estimation of genetic covariances matrices
title_full Performance of penalized maximum likelihood in estimation of genetic covariances matrices
title_fullStr Performance of penalized maximum likelihood in estimation of genetic covariances matrices
title_full_unstemmed Performance of penalized maximum likelihood in estimation of genetic covariances matrices
title_short Performance of penalized maximum likelihood in estimation of genetic covariances matrices
title_sort performance of penalized maximum likelihood in estimation of genetic covariances matrices
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3331854/
https://www.ncbi.nlm.nih.gov/pubmed/22117894
http://dx.doi.org/10.1186/1297-9686-43-39
work_keys_str_mv AT meyerkarin performanceofpenalizedmaximumlikelihoodinestimationofgeneticcovariancesmatrices