Cargando…

Subsampling Technique to Estimate Variance Component for UK-Biobank Traits

The estimation of heritability has been an important question in statistical genetics. Due to the clear mathematical properties, the modified Haseman–Elston regression has been found a bridge that connects and develops various parallel heritability estimation methods. With the increasing sample size...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Ting, Qi, Guo-An, Zhu, Jun, Xu, Hai-Ming, Chen, Guo-Bo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7978110/
https://www.ncbi.nlm.nih.gov/pubmed/33747041
http://dx.doi.org/10.3389/fgene.2021.612045
_version_ 1783667155772899328
author Xu, Ting
Qi, Guo-An
Zhu, Jun
Xu, Hai-Ming
Chen, Guo-Bo
author_facet Xu, Ting
Qi, Guo-An
Zhu, Jun
Xu, Hai-Ming
Chen, Guo-Bo
author_sort Xu, Ting
collection PubMed
description The estimation of heritability has been an important question in statistical genetics. Due to the clear mathematical properties, the modified Haseman–Elston regression has been found a bridge that connects and develops various parallel heritability estimation methods. With the increasing sample size, estimating heritability for biobank-scale data poses a challenge for statistical computation, in particular that the calculation of the genetic relationship matrix is a huge challenge in statistical computation. Using the Haseman–Elston framework, in this study we explicitly analyzed the mathematical structure of the key term tr(K(T)K), the trace of high-order term of the genetic relationship matrix, a component involved in the estimation procedure. In this study, we proposed two estimators, which can estimate tr(K(T)K) with greatly reduced sampling variance compared to the existing method under the same computational complexity. We applied this method to 81 traits in UK Biobank data and compared the chromosome-wise partition heritability with the whole-genome heritability, also as an approach for testing polygenicity.
format Online
Article
Text
id pubmed-7978110
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79781102021-03-20 Subsampling Technique to Estimate Variance Component for UK-Biobank Traits Xu, Ting Qi, Guo-An Zhu, Jun Xu, Hai-Ming Chen, Guo-Bo Front Genet Genetics The estimation of heritability has been an important question in statistical genetics. Due to the clear mathematical properties, the modified Haseman–Elston regression has been found a bridge that connects and develops various parallel heritability estimation methods. With the increasing sample size, estimating heritability for biobank-scale data poses a challenge for statistical computation, in particular that the calculation of the genetic relationship matrix is a huge challenge in statistical computation. Using the Haseman–Elston framework, in this study we explicitly analyzed the mathematical structure of the key term tr(K(T)K), the trace of high-order term of the genetic relationship matrix, a component involved in the estimation procedure. In this study, we proposed two estimators, which can estimate tr(K(T)K) with greatly reduced sampling variance compared to the existing method under the same computational complexity. We applied this method to 81 traits in UK Biobank data and compared the chromosome-wise partition heritability with the whole-genome heritability, also as an approach for testing polygenicity. Frontiers Media S.A. 2021-03-05 /pmc/articles/PMC7978110/ /pubmed/33747041 http://dx.doi.org/10.3389/fgene.2021.612045 Text en Copyright © 2021 Xu, Qi, Zhu, Xu and Chen. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xu, Ting
Qi, Guo-An
Zhu, Jun
Xu, Hai-Ming
Chen, Guo-Bo
Subsampling Technique to Estimate Variance Component for UK-Biobank Traits
title Subsampling Technique to Estimate Variance Component for UK-Biobank Traits
title_full Subsampling Technique to Estimate Variance Component for UK-Biobank Traits
title_fullStr Subsampling Technique to Estimate Variance Component for UK-Biobank Traits
title_full_unstemmed Subsampling Technique to Estimate Variance Component for UK-Biobank Traits
title_short Subsampling Technique to Estimate Variance Component for UK-Biobank Traits
title_sort subsampling technique to estimate variance component for uk-biobank traits
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7978110/
https://www.ncbi.nlm.nih.gov/pubmed/33747041
http://dx.doi.org/10.3389/fgene.2021.612045
work_keys_str_mv AT xuting subsamplingtechniquetoestimatevariancecomponentforukbiobanktraits
AT qiguoan subsamplingtechniquetoestimatevariancecomponentforukbiobanktraits
AT zhujun subsamplingtechniquetoestimatevariancecomponentforukbiobanktraits
AT xuhaiming subsamplingtechniquetoestimatevariancecomponentforukbiobanktraits
AT chenguobo subsamplingtechniquetoestimatevariancecomponentforukbiobanktraits