Cargando…

Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome

The popular Genome-wide Complex Trait Analysis (GCTA) software uses the random-effects models for estimating the narrow-sense heritability based on GWAS data of unrelated individuals without knowing and identifying the causal loci. Many methods have since extended this approach to various situations...

Descripción completa

Detalles Bibliográficos
Autores principales: Gorfine, Malka, Berndt, Sonja I., Chang-Claude, Jenny, Hoffmeister, Michael, Le Marchand, Loic, Potter, John, Slattery, Martha L., Keret, Nir, Peters, Ulrike, Hsu, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5559077/
https://www.ncbi.nlm.nih.gov/pubmed/28813438
http://dx.doi.org/10.1371/journal.pone.0181269
_version_ 1783257484048203776
author Gorfine, Malka
Berndt, Sonja I.
Chang-Claude, Jenny
Hoffmeister, Michael
Le Marchand, Loic
Potter, John
Slattery, Martha L.
Keret, Nir
Peters, Ulrike
Hsu, Li
author_facet Gorfine, Malka
Berndt, Sonja I.
Chang-Claude, Jenny
Hoffmeister, Michael
Le Marchand, Loic
Potter, John
Slattery, Martha L.
Keret, Nir
Peters, Ulrike
Hsu, Li
author_sort Gorfine, Malka
collection PubMed
description The popular Genome-wide Complex Trait Analysis (GCTA) software uses the random-effects models for estimating the narrow-sense heritability based on GWAS data of unrelated individuals without knowing and identifying the causal loci. Many methods have since extended this approach to various situations. However, since the proportion of causal loci among the variants is typically very small and GCTA uses all variants to calculate the similarities among individuals, the estimation of heritability may be unstable, resulting in a large variance of the estimates. Moreover, if the causal SNPs are not genotyped, GCTA sometimes greatly underestimates the true heritability. We present a novel narrow-sense heritability estimator, named HERRA, using well-developed ultra-high dimensional machine-learning methods, applicable to continuous or dichotomous outcomes, as other existing methods. Additionally, HERRA is applicable to time-to-event or age-at-onset outcome, which, to our knowledge, no existing method can handle. Compared to GCTA and LDAK for continuous and binary outcomes, HERRA often has a smaller variance, and when causal SNPs are not genotyped, HERRA has a much smaller empirical bias. We applied GCTA, LDAK and HERRA to a large colorectal cancer dataset using dichotomous outcome (4,312 cases, 4,356 controls, genotyped using Illumina 300K), the respective heritability estimates of GCTA, LDAK and HERRA are 0.068 (SE = 0.017), 0.072 (SE = 0.021) and 0.110 (SE = 5.19 x 10(−3)). HERRA yields over 50% increase in heritability estimate compared to GCTA or LDAK.
format Online
Article
Text
id pubmed-5559077
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55590772017-08-25 Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome Gorfine, Malka Berndt, Sonja I. Chang-Claude, Jenny Hoffmeister, Michael Le Marchand, Loic Potter, John Slattery, Martha L. Keret, Nir Peters, Ulrike Hsu, Li PLoS One Research Article The popular Genome-wide Complex Trait Analysis (GCTA) software uses the random-effects models for estimating the narrow-sense heritability based on GWAS data of unrelated individuals without knowing and identifying the causal loci. Many methods have since extended this approach to various situations. However, since the proportion of causal loci among the variants is typically very small and GCTA uses all variants to calculate the similarities among individuals, the estimation of heritability may be unstable, resulting in a large variance of the estimates. Moreover, if the causal SNPs are not genotyped, GCTA sometimes greatly underestimates the true heritability. We present a novel narrow-sense heritability estimator, named HERRA, using well-developed ultra-high dimensional machine-learning methods, applicable to continuous or dichotomous outcomes, as other existing methods. Additionally, HERRA is applicable to time-to-event or age-at-onset outcome, which, to our knowledge, no existing method can handle. Compared to GCTA and LDAK for continuous and binary outcomes, HERRA often has a smaller variance, and when causal SNPs are not genotyped, HERRA has a much smaller empirical bias. We applied GCTA, LDAK and HERRA to a large colorectal cancer dataset using dichotomous outcome (4,312 cases, 4,356 controls, genotyped using Illumina 300K), the respective heritability estimates of GCTA, LDAK and HERRA are 0.068 (SE = 0.017), 0.072 (SE = 0.021) and 0.110 (SE = 5.19 x 10(−3)). HERRA yields over 50% increase in heritability estimate compared to GCTA or LDAK. Public Library of Science 2017-08-16 /pmc/articles/PMC5559077/ /pubmed/28813438 http://dx.doi.org/10.1371/journal.pone.0181269 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Gorfine, Malka
Berndt, Sonja I.
Chang-Claude, Jenny
Hoffmeister, Michael
Le Marchand, Loic
Potter, John
Slattery, Martha L.
Keret, Nir
Peters, Ulrike
Hsu, Li
Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome
title Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome
title_full Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome
title_fullStr Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome
title_full_unstemmed Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome
title_short Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome
title_sort heritability estimation using a regularized regression approach (herra): applicable to continuous, dichotomous or age-at-onset outcome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5559077/
https://www.ncbi.nlm.nih.gov/pubmed/28813438
http://dx.doi.org/10.1371/journal.pone.0181269
work_keys_str_mv AT gorfinemalka heritabilityestimationusingaregularizedregressionapproachherraapplicabletocontinuousdichotomousorageatonsetoutcome
AT berndtsonjai heritabilityestimationusingaregularizedregressionapproachherraapplicabletocontinuousdichotomousorageatonsetoutcome
AT changclaudejenny heritabilityestimationusingaregularizedregressionapproachherraapplicabletocontinuousdichotomousorageatonsetoutcome
AT hoffmeistermichael heritabilityestimationusingaregularizedregressionapproachherraapplicabletocontinuousdichotomousorageatonsetoutcome
AT lemarchandloic heritabilityestimationusingaregularizedregressionapproachherraapplicabletocontinuousdichotomousorageatonsetoutcome
AT potterjohn heritabilityestimationusingaregularizedregressionapproachherraapplicabletocontinuousdichotomousorageatonsetoutcome
AT slatterymarthal heritabilityestimationusingaregularizedregressionapproachherraapplicabletocontinuousdichotomousorageatonsetoutcome
AT keretnir heritabilityestimationusingaregularizedregressionapproachherraapplicabletocontinuousdichotomousorageatonsetoutcome
AT petersulrike heritabilityestimationusingaregularizedregressionapproachherraapplicabletocontinuousdichotomousorageatonsetoutcome
AT hsuli heritabilityestimationusingaregularizedregressionapproachherraapplicabletocontinuousdichotomousorageatonsetoutcome