Cargando…
Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network
MOTIVATION: Genome-wide association studies is an essential tool for analyzing associations between phenotypes and single nucleotide polymorphisms (SNPs). Most of binary phenotypes in large biobanks are extremely unbalanced, which leads to inflated type I error rates for many widely used association...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697735/ https://www.ncbi.nlm.nih.gov/pubmed/37991852 http://dx.doi.org/10.1093/bioinformatics/btad707 |
_version_ | 1785154802831851520 |
---|---|
author | Xie, Hongjing Cao, Xuewei Zhang, Shuanglin Sha, Qiuying |
author_facet | Xie, Hongjing Cao, Xuewei Zhang, Shuanglin Sha, Qiuying |
author_sort | Xie, Hongjing |
collection | PubMed |
description | MOTIVATION: Genome-wide association studies is an essential tool for analyzing associations between phenotypes and single nucleotide polymorphisms (SNPs). Most of binary phenotypes in large biobanks are extremely unbalanced, which leads to inflated type I error rates for many widely used association tests for joint analysis of multiple phenotypes. In this article, we first propose a novel method to construct a Multi-Layer Network (MLN) using individuals with at least one case status among all phenotypes. Then, we introduce a computationally efficient community detection method to group phenotypes into disjoint clusters based on the MLN. Finally, we propose a novel approach, MLN with Omnibus (MLN-O), to jointly analyse the association between phenotypes and a SNP. MLN-O uses the score test to test the association of each merged phenotype in a cluster and a SNP, then uses the Omnibus test to obtain an overall test statistic to test the association between all phenotypes and a SNP. RESULTS: We conduct extensive simulation studies to reveal that the proposed approach can control type I error rates and is more powerful than some existing methods. Meanwhile, we apply the proposed method to a real data set in the UK Biobank. Using phenotypes in Chapter XIII (Diseases of the musculoskeletal system and connective tissue) in the UK Biobank, we find that MLN-O identifies more significant SNPs than other methods we compare with. AVAILABILITY AND IMPLEMENTATION: https://github.com/Hongjing-Xie/Multi-Layer-Network-with-Omnibus-MLN-O. |
format | Online Article Text |
id | pubmed-10697735 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106977352023-12-06 Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network Xie, Hongjing Cao, Xuewei Zhang, Shuanglin Sha, Qiuying Bioinformatics Original Paper MOTIVATION: Genome-wide association studies is an essential tool for analyzing associations between phenotypes and single nucleotide polymorphisms (SNPs). Most of binary phenotypes in large biobanks are extremely unbalanced, which leads to inflated type I error rates for many widely used association tests for joint analysis of multiple phenotypes. In this article, we first propose a novel method to construct a Multi-Layer Network (MLN) using individuals with at least one case status among all phenotypes. Then, we introduce a computationally efficient community detection method to group phenotypes into disjoint clusters based on the MLN. Finally, we propose a novel approach, MLN with Omnibus (MLN-O), to jointly analyse the association between phenotypes and a SNP. MLN-O uses the score test to test the association of each merged phenotype in a cluster and a SNP, then uses the Omnibus test to obtain an overall test statistic to test the association between all phenotypes and a SNP. RESULTS: We conduct extensive simulation studies to reveal that the proposed approach can control type I error rates and is more powerful than some existing methods. Meanwhile, we apply the proposed method to a real data set in the UK Biobank. Using phenotypes in Chapter XIII (Diseases of the musculoskeletal system and connective tissue) in the UK Biobank, we find that MLN-O identifies more significant SNPs than other methods we compare with. AVAILABILITY AND IMPLEMENTATION: https://github.com/Hongjing-Xie/Multi-Layer-Network-with-Omnibus-MLN-O. Oxford University Press 2023-11-22 /pmc/articles/PMC10697735/ /pubmed/37991852 http://dx.doi.org/10.1093/bioinformatics/btad707 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Xie, Hongjing Cao, Xuewei Zhang, Shuanglin Sha, Qiuying Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network |
title | Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network |
title_full | Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network |
title_fullStr | Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network |
title_full_unstemmed | Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network |
title_short | Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network |
title_sort | joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697735/ https://www.ncbi.nlm.nih.gov/pubmed/37991852 http://dx.doi.org/10.1093/bioinformatics/btad707 |
work_keys_str_mv | AT xiehongjing jointanalysisofmultiplephenotypesforextremelyunbalancedcasecontrolassociationstudiesusingmultilayernetwork AT caoxuewei jointanalysisofmultiplephenotypesforextremelyunbalancedcasecontrolassociationstudiesusingmultilayernetwork AT zhangshuanglin jointanalysisofmultiplephenotypesforextremelyunbalancedcasecontrolassociationstudiesusingmultilayernetwork AT shaqiuying jointanalysisofmultiplephenotypesforextremelyunbalancedcasecontrolassociationstudiesusingmultilayernetwork |