Cargando…

Copula miss-specification in REML multivariate genetic animal model estimation

BACKGROUND: In animal genetics, linear mixed models are used to deal with genetic and environmental effects. The variance and covariance terms of these models are usually estimated by restricted maximum likelihood (REML), which provides unbiased estimators. A strong hypothesis of REML estimation is...

Descripción completa

Detalles Bibliográficos
Autores principales: Rohmer, Tom, Ricard, Anne, David, Ingrid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9137146/
https://www.ncbi.nlm.nih.gov/pubmed/35619063
http://dx.doi.org/10.1186/s12711-022-00729-3
_version_ 1784714316047450112
author Rohmer, Tom
Ricard, Anne
David, Ingrid
author_facet Rohmer, Tom
Ricard, Anne
David, Ingrid
author_sort Rohmer, Tom
collection PubMed
description BACKGROUND: In animal genetics, linear mixed models are used to deal with genetic and environmental effects. The variance and covariance terms of these models are usually estimated by restricted maximum likelihood (REML), which provides unbiased estimators. A strong hypothesis of REML estimation is the multi-normality of the response variables. However, in practice, even if the marginal distributions of each phenotype are normal, the multi-normality assumption may be violated by non-normality of the cross-sectional dependence structure, that is to say when the copula of the multivariate distribution is not Gaussian. This study uses simulations to evaluate the impact of copula miss-specification in a bivariate animal model on REML estimations of variance components. RESULT: Bivariate phenotypes were simulated for populations undergoing selection, considering different copulas for the dependence structure between the error components. Two multi-trait situations were considered: two phenotypes were measured on the selection candidates, or only one phenotype was measured on the selection candidates. Three generations with random selection and five generations with truncation selection based on estimated breeding values were simulated. When selection was performed at random, no significant differences were observed between the REML estimations of variance components and the true parameters even for the non-Gaussian distributions. For the truncation selections, when two phenotypes were measured on candidates, biases were systematically observed in the variance components for high residual dependence in the case of non-Gaussian distributions, especially in the case of a heavy-tailed or asymmetric distribution when the two traits were measured. Conversely, when only one phenotype was measured on candidates, no difference was observed between the Gaussian and non-Gaussian distributions in REML estimations. CONCLUSIONS: This study confirms that REML can be used by geneticists to evaluate breeding values in the multivariate case even if the multivariate phenotypes deviate from normality in the situation of random selection or if one trait is not measured for the candidate under selection. Nevertheless, when the two traits are measured, the violation of the normality assumption may lead to non-negligible biases in the REML estimations of the variance-covariance components. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-022-00729-3.
format Online
Article
Text
id pubmed-9137146
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-91371462022-05-28 Copula miss-specification in REML multivariate genetic animal model estimation Rohmer, Tom Ricard, Anne David, Ingrid Genet Sel Evol Research Article BACKGROUND: In animal genetics, linear mixed models are used to deal with genetic and environmental effects. The variance and covariance terms of these models are usually estimated by restricted maximum likelihood (REML), which provides unbiased estimators. A strong hypothesis of REML estimation is the multi-normality of the response variables. However, in practice, even if the marginal distributions of each phenotype are normal, the multi-normality assumption may be violated by non-normality of the cross-sectional dependence structure, that is to say when the copula of the multivariate distribution is not Gaussian. This study uses simulations to evaluate the impact of copula miss-specification in a bivariate animal model on REML estimations of variance components. RESULT: Bivariate phenotypes were simulated for populations undergoing selection, considering different copulas for the dependence structure between the error components. Two multi-trait situations were considered: two phenotypes were measured on the selection candidates, or only one phenotype was measured on the selection candidates. Three generations with random selection and five generations with truncation selection based on estimated breeding values were simulated. When selection was performed at random, no significant differences were observed between the REML estimations of variance components and the true parameters even for the non-Gaussian distributions. For the truncation selections, when two phenotypes were measured on candidates, biases were systematically observed in the variance components for high residual dependence in the case of non-Gaussian distributions, especially in the case of a heavy-tailed or asymmetric distribution when the two traits were measured. Conversely, when only one phenotype was measured on candidates, no difference was observed between the Gaussian and non-Gaussian distributions in REML estimations. CONCLUSIONS: This study confirms that REML can be used by geneticists to evaluate breeding values in the multivariate case even if the multivariate phenotypes deviate from normality in the situation of random selection or if one trait is not measured for the candidate under selection. Nevertheless, when the two traits are measured, the violation of the normality assumption may lead to non-negligible biases in the REML estimations of the variance-covariance components. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-022-00729-3. BioMed Central 2022-05-26 /pmc/articles/PMC9137146/ /pubmed/35619063 http://dx.doi.org/10.1186/s12711-022-00729-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Rohmer, Tom
Ricard, Anne
David, Ingrid
Copula miss-specification in REML multivariate genetic animal model estimation
title Copula miss-specification in REML multivariate genetic animal model estimation
title_full Copula miss-specification in REML multivariate genetic animal model estimation
title_fullStr Copula miss-specification in REML multivariate genetic animal model estimation
title_full_unstemmed Copula miss-specification in REML multivariate genetic animal model estimation
title_short Copula miss-specification in REML multivariate genetic animal model estimation
title_sort copula miss-specification in reml multivariate genetic animal model estimation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9137146/
https://www.ncbi.nlm.nih.gov/pubmed/35619063
http://dx.doi.org/10.1186/s12711-022-00729-3
work_keys_str_mv AT rohmertom copulamissspecificationinremlmultivariategeneticanimalmodelestimation
AT ricardanne copulamissspecificationinremlmultivariategeneticanimalmodelestimation
AT davidingrid copulamissspecificationinremlmultivariategeneticanimalmodelestimation