Cargando…

Refining multivariate disease phenotypes for high chip heritability

BACKGROUND: Statistical genetics shows that the success of both genetic association studies and genomic prediction methods is positively associated with the heritability of the trait used in the analysis. Identifying highly heritable components of a complex disease can thus enhance genetic studies o...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Jiangwen, Kranzler, Henry R, Bi, Jinbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4582350/
https://www.ncbi.nlm.nih.gov/pubmed/26399736
http://dx.doi.org/10.1186/1755-8794-8-S3-S3
_version_ 1782391689032237056
author Sun, Jiangwen
Kranzler, Henry R
Bi, Jinbo
author_facet Sun, Jiangwen
Kranzler, Henry R
Bi, Jinbo
author_sort Sun, Jiangwen
collection PubMed
description BACKGROUND: Statistical genetics shows that the success of both genetic association studies and genomic prediction methods is positively associated with the heritability of the trait used in the analysis. Identifying highly heritable components of a complex disease can thus enhance genetic studies of the disease. Existing heritable component analysis methods use data from related individuals to compute linearly-combined traits to maximize heritability. Recent advances in acquiring genome-wide markers have enhanced heritability estimation using genotypic data from apparently unrelated individuals, which is referred to as the chip heritability. Novel statistical models are thus needed to identify disease components (subtypes) with high chip heritability. METHODS: We propose an optimization approach to identify highly heritable components of a complex disease as a function of multiple clinical variables. The heritability of the components is estimated directly from unrelated individuals using their genome-wide single nucleotide polymorphisms. The proposed approach can also model the fixed effects due to covariates, such as age and race, so that the derived traits have high chip heritability after correcting for fixed effects. A new sequential quadratic programming algorithm is developed to efficiently solve the proposed optimization problem. RESULTS: The proposed algorithm was validated both in simulations and the analysis of a real-world dataset that was aggregated from genetic studies of cocaine, opoid, and alcohol dependence. Simulation studies demonstrated that the proposed approach could identify the hypothesized component from multiple synthesized features. A case study on cocaine dependence (CD) identified a quantitative trait that achieved chip heritability of 0.86 estimated using a cross-validation process. This quantitative trait corresponded to the likelihood of an individual's membership in a CD subtype. Clinical analysis showed that the subtype enclosed individuals who reported heavy use of cocaine but few withdrawal symptoms. CONCLUSIONS: Extensive experiments on both synthetic and real-world data demonstrate the effectiveness of the proposed approach as a means to find meaningful disease components with high chip heritability.
format Online
Article
Text
id pubmed-4582350
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45823502015-09-28 Refining multivariate disease phenotypes for high chip heritability Sun, Jiangwen Kranzler, Henry R Bi, Jinbo BMC Med Genomics Research BACKGROUND: Statistical genetics shows that the success of both genetic association studies and genomic prediction methods is positively associated with the heritability of the trait used in the analysis. Identifying highly heritable components of a complex disease can thus enhance genetic studies of the disease. Existing heritable component analysis methods use data from related individuals to compute linearly-combined traits to maximize heritability. Recent advances in acquiring genome-wide markers have enhanced heritability estimation using genotypic data from apparently unrelated individuals, which is referred to as the chip heritability. Novel statistical models are thus needed to identify disease components (subtypes) with high chip heritability. METHODS: We propose an optimization approach to identify highly heritable components of a complex disease as a function of multiple clinical variables. The heritability of the components is estimated directly from unrelated individuals using their genome-wide single nucleotide polymorphisms. The proposed approach can also model the fixed effects due to covariates, such as age and race, so that the derived traits have high chip heritability after correcting for fixed effects. A new sequential quadratic programming algorithm is developed to efficiently solve the proposed optimization problem. RESULTS: The proposed algorithm was validated both in simulations and the analysis of a real-world dataset that was aggregated from genetic studies of cocaine, opoid, and alcohol dependence. Simulation studies demonstrated that the proposed approach could identify the hypothesized component from multiple synthesized features. A case study on cocaine dependence (CD) identified a quantitative trait that achieved chip heritability of 0.86 estimated using a cross-validation process. This quantitative trait corresponded to the likelihood of an individual's membership in a CD subtype. Clinical analysis showed that the subtype enclosed individuals who reported heavy use of cocaine but few withdrawal symptoms. CONCLUSIONS: Extensive experiments on both synthetic and real-world data demonstrate the effectiveness of the proposed approach as a means to find meaningful disease components with high chip heritability. BioMed Central 2015-09-23 /pmc/articles/PMC4582350/ /pubmed/26399736 http://dx.doi.org/10.1186/1755-8794-8-S3-S3 Text en Copyright © 2015 Sun et al.; http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Sun, Jiangwen
Kranzler, Henry R
Bi, Jinbo
Refining multivariate disease phenotypes for high chip heritability
title Refining multivariate disease phenotypes for high chip heritability
title_full Refining multivariate disease phenotypes for high chip heritability
title_fullStr Refining multivariate disease phenotypes for high chip heritability
title_full_unstemmed Refining multivariate disease phenotypes for high chip heritability
title_short Refining multivariate disease phenotypes for high chip heritability
title_sort refining multivariate disease phenotypes for high chip heritability
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4582350/
https://www.ncbi.nlm.nih.gov/pubmed/26399736
http://dx.doi.org/10.1186/1755-8794-8-S3-S3
work_keys_str_mv AT sunjiangwen refiningmultivariatediseasephenotypesforhighchipheritability
AT kranzlerhenryr refiningmultivariatediseasephenotypesforhighchipheritability
AT bijinbo refiningmultivariatediseasephenotypesforhighchipheritability