Cargando…
Hierarchical structural component model for pathway analysis of common variants
BACKGROUND: Genome-wide association studies (GWAS) have been widely used to identify phenotype-related genetic variants using many statistical methods, such as logistic and linear regression. However, GWAS-identified SNPs, as identified with stringent statistical significance, explain just a small p...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7038534/ https://www.ncbi.nlm.nih.gov/pubmed/32093692 http://dx.doi.org/10.1186/s12920-019-0650-0 |
_version_ | 1783500662605086720 |
---|---|
author | Jiang, Nan Lee, Sungyoung Park, Taesung |
author_facet | Jiang, Nan Lee, Sungyoung Park, Taesung |
author_sort | Jiang, Nan |
collection | PubMed |
description | BACKGROUND: Genome-wide association studies (GWAS) have been widely used to identify phenotype-related genetic variants using many statistical methods, such as logistic and linear regression. However, GWAS-identified SNPs, as identified with stringent statistical significance, explain just a small portion of the overall estimated genetic heritability. To address this ‘missing heritability’ issue, gene- and pathway-based analysis, and biological mechanisms, have been used for many GWAS studies. However, many of these methods often neglect the correlation between genes and between pathways. METHODS: We constructed a hierarchical component model that considers correlations both between genes and between pathways. Based on this model, we propose a novel pathway analysis method for GWAS datasets, Hierarchical structural Component Model for Pathway analysis of Common vAriants (HisCoM-PCA). HisCoM-PCA first summarizes the common variants of each gene, first at the gene-level, and then analyzes all pathways simultaneously by ridge-type penalization of both the gene and pathway effects on the phenotype. Statistical significance of the gene and pathway coefficients can be examined by permutation tests. RESULTS: Using the simulation data set of Genetic Analysis Workshop 17 (GAW17), for both binary and continuous phenotypes, we showed that HisCoM-PCA well-controlled type I error, and had a higher empirical power compared to several other methods. In addition, we applied our method to a SNP chip dataset of KARE for four human physiologic traits: (1) type 2 diabetes; (2) hypertension; (3) systolic blood pressure; and (4) diastolic blood pressure. Those results showed that HisCoM-PCA could successfully identify signal pathways with superior statistical and biological significance. CONCLUSIONS: Our approach has the advantage of providing an intuitive biological interpretation for associations between common variants and phenotypes, via pathway information, potentially addressing the missing heritability conundrum. |
format | Online Article Text |
id | pubmed-7038534 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-70385342020-03-02 Hierarchical structural component model for pathway analysis of common variants Jiang, Nan Lee, Sungyoung Park, Taesung BMC Med Genomics Methodology BACKGROUND: Genome-wide association studies (GWAS) have been widely used to identify phenotype-related genetic variants using many statistical methods, such as logistic and linear regression. However, GWAS-identified SNPs, as identified with stringent statistical significance, explain just a small portion of the overall estimated genetic heritability. To address this ‘missing heritability’ issue, gene- and pathway-based analysis, and biological mechanisms, have been used for many GWAS studies. However, many of these methods often neglect the correlation between genes and between pathways. METHODS: We constructed a hierarchical component model that considers correlations both between genes and between pathways. Based on this model, we propose a novel pathway analysis method for GWAS datasets, Hierarchical structural Component Model for Pathway analysis of Common vAriants (HisCoM-PCA). HisCoM-PCA first summarizes the common variants of each gene, first at the gene-level, and then analyzes all pathways simultaneously by ridge-type penalization of both the gene and pathway effects on the phenotype. Statistical significance of the gene and pathway coefficients can be examined by permutation tests. RESULTS: Using the simulation data set of Genetic Analysis Workshop 17 (GAW17), for both binary and continuous phenotypes, we showed that HisCoM-PCA well-controlled type I error, and had a higher empirical power compared to several other methods. In addition, we applied our method to a SNP chip dataset of KARE for four human physiologic traits: (1) type 2 diabetes; (2) hypertension; (3) systolic blood pressure; and (4) diastolic blood pressure. Those results showed that HisCoM-PCA could successfully identify signal pathways with superior statistical and biological significance. CONCLUSIONS: Our approach has the advantage of providing an intuitive biological interpretation for associations between common variants and phenotypes, via pathway information, potentially addressing the missing heritability conundrum. BioMed Central 2020-02-24 /pmc/articles/PMC7038534/ /pubmed/32093692 http://dx.doi.org/10.1186/s12920-019-0650-0 Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Jiang, Nan Lee, Sungyoung Park, Taesung Hierarchical structural component model for pathway analysis of common variants |
title | Hierarchical structural component model for pathway analysis of common variants |
title_full | Hierarchical structural component model for pathway analysis of common variants |
title_fullStr | Hierarchical structural component model for pathway analysis of common variants |
title_full_unstemmed | Hierarchical structural component model for pathway analysis of common variants |
title_short | Hierarchical structural component model for pathway analysis of common variants |
title_sort | hierarchical structural component model for pathway analysis of common variants |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7038534/ https://www.ncbi.nlm.nih.gov/pubmed/32093692 http://dx.doi.org/10.1186/s12920-019-0650-0 |
work_keys_str_mv | AT jiangnan hierarchicalstructuralcomponentmodelforpathwayanalysisofcommonvariants AT leesungyoung hierarchicalstructuralcomponentmodelforpathwayanalysisofcommonvariants AT parktaesung hierarchicalstructuralcomponentmodelforpathwayanalysisofcommonvariants |