Cargando…

Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies

Among the myriad of statistical methods that identify gene–gene interactions in the realm of qualitative genome-wide association studies, gene-based interactions are not only powerful statistically, but also they are interpretable biologically. However, they have limited statistical detection by mak...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Yingjie, Wu, Chenxi, Yuan, Zhian, Wang, Yansu, Liang, Zhen, Wang, Yang, Zhang, Yi, Xu, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8716787/
https://www.ncbi.nlm.nih.gov/pubmed/34977040
http://dx.doi.org/10.3389/fcell.2021.801113
_version_ 1784624391182614528
author Guo, Yingjie
Wu, Chenxi
Yuan, Zhian
Wang, Yansu
Liang, Zhen
Wang, Yang
Zhang, Yi
Xu, Lei
author_facet Guo, Yingjie
Wu, Chenxi
Yuan, Zhian
Wang, Yansu
Liang, Zhen
Wang, Yang
Zhang, Yi
Xu, Lei
author_sort Guo, Yingjie
collection PubMed
description Among the myriad of statistical methods that identify gene–gene interactions in the realm of qualitative genome-wide association studies, gene-based interactions are not only powerful statistically, but also they are interpretable biologically. However, they have limited statistical detection by making assumptions on the association between traits and single nucleotide polymorphisms. Thus, a gene-based method (GGInt-XGBoost) originated from XGBoost is proposed in this article. Assuming that log odds ratio of disease traits satisfies the additive relationship if the pair of genes had no interactions, the difference in error between the XGBoost model with and without additive constraint could indicate gene–gene interaction; we then used a permutation-based statistical test to assess this difference and to provide a statistical p-value to represent the significance of the interaction. Experimental results on both simulation and real data showed that our approach had superior performance than previous experiments to detect gene–gene interactions.
format Online
Article
Text
id pubmed-8716787
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-87167872021-12-31 Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies Guo, Yingjie Wu, Chenxi Yuan, Zhian Wang, Yansu Liang, Zhen Wang, Yang Zhang, Yi Xu, Lei Front Cell Dev Biol Cell and Developmental Biology Among the myriad of statistical methods that identify gene–gene interactions in the realm of qualitative genome-wide association studies, gene-based interactions are not only powerful statistically, but also they are interpretable biologically. However, they have limited statistical detection by making assumptions on the association between traits and single nucleotide polymorphisms. Thus, a gene-based method (GGInt-XGBoost) originated from XGBoost is proposed in this article. Assuming that log odds ratio of disease traits satisfies the additive relationship if the pair of genes had no interactions, the difference in error between the XGBoost model with and without additive constraint could indicate gene–gene interaction; we then used a permutation-based statistical test to assess this difference and to provide a statistical p-value to represent the significance of the interaction. Experimental results on both simulation and real data showed that our approach had superior performance than previous experiments to detect gene–gene interactions. Frontiers Media S.A. 2021-12-16 /pmc/articles/PMC8716787/ /pubmed/34977040 http://dx.doi.org/10.3389/fcell.2021.801113 Text en Copyright © 2021 Guo, Wu, Yuan, Wang, Liang, Wang, Zhang and Xu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cell and Developmental Biology
Guo, Yingjie
Wu, Chenxi
Yuan, Zhian
Wang, Yansu
Liang, Zhen
Wang, Yang
Zhang, Yi
Xu, Lei
Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies
title Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies
title_full Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies
title_fullStr Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies
title_full_unstemmed Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies
title_short Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies
title_sort gene-based testing of interactions using xgboost in genome-wide association studies
topic Cell and Developmental Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8716787/
https://www.ncbi.nlm.nih.gov/pubmed/34977040
http://dx.doi.org/10.3389/fcell.2021.801113
work_keys_str_mv AT guoyingjie genebasedtestingofinteractionsusingxgboostingenomewideassociationstudies
AT wuchenxi genebasedtestingofinteractionsusingxgboostingenomewideassociationstudies
AT yuanzhian genebasedtestingofinteractionsusingxgboostingenomewideassociationstudies
AT wangyansu genebasedtestingofinteractionsusingxgboostingenomewideassociationstudies
AT liangzhen genebasedtestingofinteractionsusingxgboostingenomewideassociationstudies
AT wangyang genebasedtestingofinteractionsusingxgboostingenomewideassociationstudies
AT zhangyi genebasedtestingofinteractionsusingxgboostingenomewideassociationstudies
AT xulei genebasedtestingofinteractionsusingxgboostingenomewideassociationstudies