Cargando…

Understanding the potential bias of variance components estimators when using genomic models

BACKGROUND: Genomic models that link phenotypes to dense genotype information are increasingly being used for infering variance parameters in genetics studies. The variance parameters of these models can be inferred using restricted maximum likelihood, which produces consistent, asymptotically norma...

Descripción completa

Detalles Bibliográficos
Autores principales: Cuyabano, Beatriz C. D., Sørensen, A. Christian, Sørensen, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6080367/
https://www.ncbi.nlm.nih.gov/pubmed/30081816
http://dx.doi.org/10.1186/s12711-018-0411-0
_version_ 1783345460329578496
author Cuyabano, Beatriz C. D.
Sørensen, A. Christian
Sørensen, Peter
author_facet Cuyabano, Beatriz C. D.
Sørensen, A. Christian
Sørensen, Peter
author_sort Cuyabano, Beatriz C. D.
collection PubMed
description BACKGROUND: Genomic models that link phenotypes to dense genotype information are increasingly being used for infering variance parameters in genetics studies. The variance parameters of these models can be inferred using restricted maximum likelihood, which produces consistent, asymptotically normal estimates of variance components under the true model. These properties are not guaranteed to hold when the covariance structure of the data specified by the genomic model differs substantially from the covariance structure specified by the true model, and in this case, the likelihood of the model is said to be misspecified. If the covariance structure specified by the genomic model provides a poor description of that specified by the true model, the likelihood misspecification may lead to incorrect inferences. RESULTS: This work provides a theoretical analysis of the genomic models based on splitting the misspecified likelihood equations into components, which isolate those that contribute to incorrect inferences, providing an informative measure, defined as [Formula: see text] , to compare the covariance structure of the data specified by the genomic and the true models. This comparison of the covariance structures allows us to determine whether or not bias in the variance components estimates is expected to occur. CONCLUSIONS: The theory presented can be used to provide an explanation for the success of a number of recently reported approaches that are suggested to remove sources of bias of heritability estimates. Furthermore, however complex is the quantification of this bias, we can determine that, in genomic models that consider a single genomic component to estimate heritability (assuming SNP effects are all i.i.d.), the bias of the estimator tends to be downward, when it exists. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-018-0411-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6080367
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60803672018-08-09 Understanding the potential bias of variance components estimators when using genomic models Cuyabano, Beatriz C. D. Sørensen, A. Christian Sørensen, Peter Genet Sel Evol Research Article BACKGROUND: Genomic models that link phenotypes to dense genotype information are increasingly being used for infering variance parameters in genetics studies. The variance parameters of these models can be inferred using restricted maximum likelihood, which produces consistent, asymptotically normal estimates of variance components under the true model. These properties are not guaranteed to hold when the covariance structure of the data specified by the genomic model differs substantially from the covariance structure specified by the true model, and in this case, the likelihood of the model is said to be misspecified. If the covariance structure specified by the genomic model provides a poor description of that specified by the true model, the likelihood misspecification may lead to incorrect inferences. RESULTS: This work provides a theoretical analysis of the genomic models based on splitting the misspecified likelihood equations into components, which isolate those that contribute to incorrect inferences, providing an informative measure, defined as [Formula: see text] , to compare the covariance structure of the data specified by the genomic and the true models. This comparison of the covariance structures allows us to determine whether or not bias in the variance components estimates is expected to occur. CONCLUSIONS: The theory presented can be used to provide an explanation for the success of a number of recently reported approaches that are suggested to remove sources of bias of heritability estimates. Furthermore, however complex is the quantification of this bias, we can determine that, in genomic models that consider a single genomic component to estimate heritability (assuming SNP effects are all i.i.d.), the bias of the estimator tends to be downward, when it exists. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-018-0411-0) contains supplementary material, which is available to authorized users. BioMed Central 2018-08-06 /pmc/articles/PMC6080367/ /pubmed/30081816 http://dx.doi.org/10.1186/s12711-018-0411-0 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Cuyabano, Beatriz C. D.
Sørensen, A. Christian
Sørensen, Peter
Understanding the potential bias of variance components estimators when using genomic models
title Understanding the potential bias of variance components estimators when using genomic models
title_full Understanding the potential bias of variance components estimators when using genomic models
title_fullStr Understanding the potential bias of variance components estimators when using genomic models
title_full_unstemmed Understanding the potential bias of variance components estimators when using genomic models
title_short Understanding the potential bias of variance components estimators when using genomic models
title_sort understanding the potential bias of variance components estimators when using genomic models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6080367/
https://www.ncbi.nlm.nih.gov/pubmed/30081816
http://dx.doi.org/10.1186/s12711-018-0411-0
work_keys_str_mv AT cuyabanobeatrizcd understandingthepotentialbiasofvariancecomponentsestimatorswhenusinggenomicmodels
AT sørensenachristian understandingthepotentialbiasofvariancecomponentsestimatorswhenusinggenomicmodels
AT sørensenpeter understandingthepotentialbiasofvariancecomponentsestimatorswhenusinggenomicmodels