Cargando…

Interpretable network-guided epistasis detection

BACKGROUND: Detecting epistatic interactions at the gene level is essential to understanding the biological mechanisms of complex diseases. Unfortunately, genome-wide interaction association studies involve many statistical challenges that make such detection hard. We propose a multi-step protocol f...

Descripción completa

Detalles Bibliográficos
Autores principales: Duroux, Diane, Climente-González, Héctor, Azencott, Chloé-Agathe, Van Steen, Kristel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8848319/
https://www.ncbi.nlm.nih.gov/pubmed/35134928
http://dx.doi.org/10.1093/gigascience/giab093
_version_ 1784652224082739200
author Duroux, Diane
Climente-González, Héctor
Azencott, Chloé-Agathe
Van Steen, Kristel
author_facet Duroux, Diane
Climente-González, Héctor
Azencott, Chloé-Agathe
Van Steen, Kristel
author_sort Duroux, Diane
collection PubMed
description BACKGROUND: Detecting epistatic interactions at the gene level is essential to understanding the biological mechanisms of complex diseases. Unfortunately, genome-wide interaction association studies involve many statistical challenges that make such detection hard. We propose a multi-step protocol for epistasis detection along the edges of a gene-gene co-function network. Such an approach reduces the number of tests performed and provides interpretable interactions while keeping type I error controlled. Yet, mapping gene interactions into testable single-nucleotide polymorphism (SNP)-interaction hypotheses, as well as computing gene pair association scores from SNP pair ones, is not trivial. RESULTS: Here we compare 3 SNP-gene mappings (positional overlap, expression quantitative trait loci, and proximity in 3D structure) and use the adaptive truncated product method to compute gene pair scores. This method is non-parametric, does not require a known null distribution, and is fast to compute. We apply multiple variants of this protocol to a genome-wide association study dataset on inflammatory bowel disease. Different configurations produced different results, highlighting that various mechanisms are implicated in inflammatory bowel disease, while at the same time, results overlapped with known disease characteristics. Importantly, the proposed pipeline also differs from a conventional approach where no network is used, showing the potential for additional discoveries when prior biological knowledge is incorporated into epistasis detection.
format Online
Article
Text
id pubmed-8848319
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-88483192022-02-17 Interpretable network-guided epistasis detection Duroux, Diane Climente-González, Héctor Azencott, Chloé-Agathe Van Steen, Kristel Gigascience Research BACKGROUND: Detecting epistatic interactions at the gene level is essential to understanding the biological mechanisms of complex diseases. Unfortunately, genome-wide interaction association studies involve many statistical challenges that make such detection hard. We propose a multi-step protocol for epistasis detection along the edges of a gene-gene co-function network. Such an approach reduces the number of tests performed and provides interpretable interactions while keeping type I error controlled. Yet, mapping gene interactions into testable single-nucleotide polymorphism (SNP)-interaction hypotheses, as well as computing gene pair association scores from SNP pair ones, is not trivial. RESULTS: Here we compare 3 SNP-gene mappings (positional overlap, expression quantitative trait loci, and proximity in 3D structure) and use the adaptive truncated product method to compute gene pair scores. This method is non-parametric, does not require a known null distribution, and is fast to compute. We apply multiple variants of this protocol to a genome-wide association study dataset on inflammatory bowel disease. Different configurations produced different results, highlighting that various mechanisms are implicated in inflammatory bowel disease, while at the same time, results overlapped with known disease characteristics. Importantly, the proposed pipeline also differs from a conventional approach where no network is used, showing the potential for additional discoveries when prior biological knowledge is incorporated into epistasis detection. Oxford University Press 2022-02-04 /pmc/articles/PMC8848319/ /pubmed/35134928 http://dx.doi.org/10.1093/gigascience/giab093 Text en © The Author(s) 2022. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Duroux, Diane
Climente-González, Héctor
Azencott, Chloé-Agathe
Van Steen, Kristel
Interpretable network-guided epistasis detection
title Interpretable network-guided epistasis detection
title_full Interpretable network-guided epistasis detection
title_fullStr Interpretable network-guided epistasis detection
title_full_unstemmed Interpretable network-guided epistasis detection
title_short Interpretable network-guided epistasis detection
title_sort interpretable network-guided epistasis detection
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8848319/
https://www.ncbi.nlm.nih.gov/pubmed/35134928
http://dx.doi.org/10.1093/gigascience/giab093
work_keys_str_mv AT durouxdiane interpretablenetworkguidedepistasisdetection
AT climentegonzalezhector interpretablenetworkguidedepistasisdetection
AT azencottchloeagathe interpretablenetworkguidedepistasisdetection
AT vansteenkristel interpretablenetworkguidedepistasisdetection