Cargando…

Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network

MOTIVATION: Genome-wide association studies is an essential tool for analyzing associations between phenotypes and single nucleotide polymorphisms (SNPs). Most of binary phenotypes in large biobanks are extremely unbalanced, which leads to inflated type I error rates for many widely used association...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Hongjing, Cao, Xuewei, Zhang, Shuanglin, Sha, Qiuying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697735/
https://www.ncbi.nlm.nih.gov/pubmed/37991852
http://dx.doi.org/10.1093/bioinformatics/btad707
_version_ 1785154802831851520
author Xie, Hongjing
Cao, Xuewei
Zhang, Shuanglin
Sha, Qiuying
author_facet Xie, Hongjing
Cao, Xuewei
Zhang, Shuanglin
Sha, Qiuying
author_sort Xie, Hongjing
collection PubMed
description MOTIVATION: Genome-wide association studies is an essential tool for analyzing associations between phenotypes and single nucleotide polymorphisms (SNPs). Most of binary phenotypes in large biobanks are extremely unbalanced, which leads to inflated type I error rates for many widely used association tests for joint analysis of multiple phenotypes. In this article, we first propose a novel method to construct a Multi-Layer Network (MLN) using individuals with at least one case status among all phenotypes. Then, we introduce a computationally efficient community detection method to group phenotypes into disjoint clusters based on the MLN. Finally, we propose a novel approach, MLN with Omnibus (MLN-O), to jointly analyse the association between phenotypes and a SNP. MLN-O uses the score test to test the association of each merged phenotype in a cluster and a SNP, then uses the Omnibus test to obtain an overall test statistic to test the association between all phenotypes and a SNP. RESULTS: We conduct extensive simulation studies to reveal that the proposed approach can control type I error rates and is more powerful than some existing methods. Meanwhile, we apply the proposed method to a real data set in the UK Biobank. Using phenotypes in Chapter XIII (Diseases of the musculoskeletal system and connective tissue) in the UK Biobank, we find that MLN-O identifies more significant SNPs than other methods we compare with. AVAILABILITY AND IMPLEMENTATION: https://github.com/Hongjing-Xie/Multi-Layer-Network-with-Omnibus-MLN-O.
format Online
Article
Text
id pubmed-10697735
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106977352023-12-06 Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network Xie, Hongjing Cao, Xuewei Zhang, Shuanglin Sha, Qiuying Bioinformatics Original Paper MOTIVATION: Genome-wide association studies is an essential tool for analyzing associations between phenotypes and single nucleotide polymorphisms (SNPs). Most of binary phenotypes in large biobanks are extremely unbalanced, which leads to inflated type I error rates for many widely used association tests for joint analysis of multiple phenotypes. In this article, we first propose a novel method to construct a Multi-Layer Network (MLN) using individuals with at least one case status among all phenotypes. Then, we introduce a computationally efficient community detection method to group phenotypes into disjoint clusters based on the MLN. Finally, we propose a novel approach, MLN with Omnibus (MLN-O), to jointly analyse the association between phenotypes and a SNP. MLN-O uses the score test to test the association of each merged phenotype in a cluster and a SNP, then uses the Omnibus test to obtain an overall test statistic to test the association between all phenotypes and a SNP. RESULTS: We conduct extensive simulation studies to reveal that the proposed approach can control type I error rates and is more powerful than some existing methods. Meanwhile, we apply the proposed method to a real data set in the UK Biobank. Using phenotypes in Chapter XIII (Diseases of the musculoskeletal system and connective tissue) in the UK Biobank, we find that MLN-O identifies more significant SNPs than other methods we compare with. AVAILABILITY AND IMPLEMENTATION: https://github.com/Hongjing-Xie/Multi-Layer-Network-with-Omnibus-MLN-O. Oxford University Press 2023-11-22 /pmc/articles/PMC10697735/ /pubmed/37991852 http://dx.doi.org/10.1093/bioinformatics/btad707 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Xie, Hongjing
Cao, Xuewei
Zhang, Shuanglin
Sha, Qiuying
Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network
title Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network
title_full Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network
title_fullStr Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network
title_full_unstemmed Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network
title_short Joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network
title_sort joint analysis of multiple phenotypes for extremely unbalanced case–control association studies using multi-layer network
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697735/
https://www.ncbi.nlm.nih.gov/pubmed/37991852
http://dx.doi.org/10.1093/bioinformatics/btad707
work_keys_str_mv AT xiehongjing jointanalysisofmultiplephenotypesforextremelyunbalancedcasecontrolassociationstudiesusingmultilayernetwork
AT caoxuewei jointanalysisofmultiplephenotypesforextremelyunbalancedcasecontrolassociationstudiesusingmultilayernetwork
AT zhangshuanglin jointanalysisofmultiplephenotypesforextremelyunbalancedcasecontrolassociationstudiesusingmultilayernetwork
AT shaqiuying jointanalysisofmultiplephenotypesforextremelyunbalancedcasecontrolassociationstudiesusingmultilayernetwork