Cargando…
HCLC-FC: A novel statistical method for phenome-wide association studies
The emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association studies (PheWAS). In PheWAS, the whole phenome can be divided into numerous phenotypic categories according to the genetic architecture across phenotypes. Curren...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9645610/ https://www.ncbi.nlm.nih.gov/pubmed/36350801 http://dx.doi.org/10.1371/journal.pone.0276646 |
_version_ | 1784826998818865152 |
---|---|
author | Liang, Xiaoyu Cao, Xuewei Sha, Qiuying Zhang, Shuanglin |
author_facet | Liang, Xiaoyu Cao, Xuewei Sha, Qiuying Zhang, Shuanglin |
author_sort | Liang, Xiaoyu |
collection | PubMed |
description | The emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association studies (PheWAS). In PheWAS, the whole phenome can be divided into numerous phenotypic categories according to the genetic architecture across phenotypes. Currently, statistical analyses for PheWAS are mainly univariate analyses, which test the association between one genetic variant and one phenotype at a time. In this article, we derived a novel and powerful multivariate method for PheWAS. The proposed method involves three steps. In the first step, we apply the bottom-up hierarchical clustering method to partition a large number of phenotypes into disjoint clusters within each phenotypic category. In the second step, the clustering linear combination method is used to combine test statistics within each category based on the phenotypic clusters and obtain p-values from each phenotypic category. In the third step, we propose a new false discovery rate (FDR) control approach. We perform extensive simulation studies to compare the performance of our method with that of other existing methods. The results show that our proposed method controls FDR very well and outperforms other methods we compared with. We also apply the proposed approach to a set of EMR-based phenotypes across more than 300,000 samples from the UK Biobank. We find that the proposed approach not only can well-control FDR at a nominal level but also successfully identify 1,244 significant SNPs that are reported to be associated with some phenotypes in the GWAS catalog. Our open-access tools and instructions on how to implement HCLC-FC are available at https://github.com/XiaoyuLiang/HCLCFC. |
format | Online Article Text |
id | pubmed-9645610 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-96456102022-11-15 HCLC-FC: A novel statistical method for phenome-wide association studies Liang, Xiaoyu Cao, Xuewei Sha, Qiuying Zhang, Shuanglin PLoS One Research Article The emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association studies (PheWAS). In PheWAS, the whole phenome can be divided into numerous phenotypic categories according to the genetic architecture across phenotypes. Currently, statistical analyses for PheWAS are mainly univariate analyses, which test the association between one genetic variant and one phenotype at a time. In this article, we derived a novel and powerful multivariate method for PheWAS. The proposed method involves three steps. In the first step, we apply the bottom-up hierarchical clustering method to partition a large number of phenotypes into disjoint clusters within each phenotypic category. In the second step, the clustering linear combination method is used to combine test statistics within each category based on the phenotypic clusters and obtain p-values from each phenotypic category. In the third step, we propose a new false discovery rate (FDR) control approach. We perform extensive simulation studies to compare the performance of our method with that of other existing methods. The results show that our proposed method controls FDR very well and outperforms other methods we compared with. We also apply the proposed approach to a set of EMR-based phenotypes across more than 300,000 samples from the UK Biobank. We find that the proposed approach not only can well-control FDR at a nominal level but also successfully identify 1,244 significant SNPs that are reported to be associated with some phenotypes in the GWAS catalog. Our open-access tools and instructions on how to implement HCLC-FC are available at https://github.com/XiaoyuLiang/HCLCFC. Public Library of Science 2022-11-09 /pmc/articles/PMC9645610/ /pubmed/36350801 http://dx.doi.org/10.1371/journal.pone.0276646 Text en © 2022 Liang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Liang, Xiaoyu Cao, Xuewei Sha, Qiuying Zhang, Shuanglin HCLC-FC: A novel statistical method for phenome-wide association studies |
title | HCLC-FC: A novel statistical method for phenome-wide association studies |
title_full | HCLC-FC: A novel statistical method for phenome-wide association studies |
title_fullStr | HCLC-FC: A novel statistical method for phenome-wide association studies |
title_full_unstemmed | HCLC-FC: A novel statistical method for phenome-wide association studies |
title_short | HCLC-FC: A novel statistical method for phenome-wide association studies |
title_sort | hclc-fc: a novel statistical method for phenome-wide association studies |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9645610/ https://www.ncbi.nlm.nih.gov/pubmed/36350801 http://dx.doi.org/10.1371/journal.pone.0276646 |
work_keys_str_mv | AT liangxiaoyu hclcfcanovelstatisticalmethodforphenomewideassociationstudies AT caoxuewei hclcfcanovelstatisticalmethodforphenomewideassociationstudies AT shaqiuying hclcfcanovelstatisticalmethodforphenomewideassociationstudies AT zhangshuanglin hclcfcanovelstatisticalmethodforphenomewideassociationstudies |