Cargando…

Hierarchical structural component model for pathway analysis of common variants

BACKGROUND: Genome-wide association studies (GWAS) have been widely used to identify phenotype-related genetic variants using many statistical methods, such as logistic and linear regression. However, GWAS-identified SNPs, as identified with stringent statistical significance, explain just a small p...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Nan, Lee, Sungyoung, Park, Taesung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7038534/
https://www.ncbi.nlm.nih.gov/pubmed/32093692
http://dx.doi.org/10.1186/s12920-019-0650-0
_version_ 1783500662605086720
author Jiang, Nan
Lee, Sungyoung
Park, Taesung
author_facet Jiang, Nan
Lee, Sungyoung
Park, Taesung
author_sort Jiang, Nan
collection PubMed
description BACKGROUND: Genome-wide association studies (GWAS) have been widely used to identify phenotype-related genetic variants using many statistical methods, such as logistic and linear regression. However, GWAS-identified SNPs, as identified with stringent statistical significance, explain just a small portion of the overall estimated genetic heritability. To address this ‘missing heritability’ issue, gene- and pathway-based analysis, and biological mechanisms, have been used for many GWAS studies. However, many of these methods often neglect the correlation between genes and between pathways. METHODS: We constructed a hierarchical component model that considers correlations both between genes and between pathways. Based on this model, we propose a novel pathway analysis method for GWAS datasets, Hierarchical structural Component Model for Pathway analysis of Common vAriants (HisCoM-PCA). HisCoM-PCA first summarizes the common variants of each gene, first at the gene-level, and then analyzes all pathways simultaneously by ridge-type penalization of both the gene and pathway effects on the phenotype. Statistical significance of the gene and pathway coefficients can be examined by permutation tests. RESULTS: Using the simulation data set of Genetic Analysis Workshop 17 (GAW17), for both binary and continuous phenotypes, we showed that HisCoM-PCA well-controlled type I error, and had a higher empirical power compared to several other methods. In addition, we applied our method to a SNP chip dataset of KARE for four human physiologic traits: (1) type 2 diabetes; (2) hypertension; (3) systolic blood pressure; and (4) diastolic blood pressure. Those results showed that HisCoM-PCA could successfully identify signal pathways with superior statistical and biological significance. CONCLUSIONS: Our approach has the advantage of providing an intuitive biological interpretation for associations between common variants and phenotypes, via pathway information, potentially addressing the missing heritability conundrum.
format Online
Article
Text
id pubmed-7038534
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70385342020-03-02 Hierarchical structural component model for pathway analysis of common variants Jiang, Nan Lee, Sungyoung Park, Taesung BMC Med Genomics Methodology BACKGROUND: Genome-wide association studies (GWAS) have been widely used to identify phenotype-related genetic variants using many statistical methods, such as logistic and linear regression. However, GWAS-identified SNPs, as identified with stringent statistical significance, explain just a small portion of the overall estimated genetic heritability. To address this ‘missing heritability’ issue, gene- and pathway-based analysis, and biological mechanisms, have been used for many GWAS studies. However, many of these methods often neglect the correlation between genes and between pathways. METHODS: We constructed a hierarchical component model that considers correlations both between genes and between pathways. Based on this model, we propose a novel pathway analysis method for GWAS datasets, Hierarchical structural Component Model for Pathway analysis of Common vAriants (HisCoM-PCA). HisCoM-PCA first summarizes the common variants of each gene, first at the gene-level, and then analyzes all pathways simultaneously by ridge-type penalization of both the gene and pathway effects on the phenotype. Statistical significance of the gene and pathway coefficients can be examined by permutation tests. RESULTS: Using the simulation data set of Genetic Analysis Workshop 17 (GAW17), for both binary and continuous phenotypes, we showed that HisCoM-PCA well-controlled type I error, and had a higher empirical power compared to several other methods. In addition, we applied our method to a SNP chip dataset of KARE for four human physiologic traits: (1) type 2 diabetes; (2) hypertension; (3) systolic blood pressure; and (4) diastolic blood pressure. Those results showed that HisCoM-PCA could successfully identify signal pathways with superior statistical and biological significance. CONCLUSIONS: Our approach has the advantage of providing an intuitive biological interpretation for associations between common variants and phenotypes, via pathway information, potentially addressing the missing heritability conundrum. BioMed Central 2020-02-24 /pmc/articles/PMC7038534/ /pubmed/32093692 http://dx.doi.org/10.1186/s12920-019-0650-0 Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Jiang, Nan
Lee, Sungyoung
Park, Taesung
Hierarchical structural component model for pathway analysis of common variants
title Hierarchical structural component model for pathway analysis of common variants
title_full Hierarchical structural component model for pathway analysis of common variants
title_fullStr Hierarchical structural component model for pathway analysis of common variants
title_full_unstemmed Hierarchical structural component model for pathway analysis of common variants
title_short Hierarchical structural component model for pathway analysis of common variants
title_sort hierarchical structural component model for pathway analysis of common variants
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7038534/
https://www.ncbi.nlm.nih.gov/pubmed/32093692
http://dx.doi.org/10.1186/s12920-019-0650-0
work_keys_str_mv AT jiangnan hierarchicalstructuralcomponentmodelforpathwayanalysisofcommonvariants
AT leesungyoung hierarchicalstructuralcomponentmodelforpathwayanalysisofcommonvariants
AT parktaesung hierarchicalstructuralcomponentmodelforpathwayanalysisofcommonvariants