Cargando…

Large sample size and nonlinear sparse models outline epistatic effects in inflammatory bowel disease

BACKGROUND: Despite clear evidence of nonlinear interactions in the molecular architecture of polygenic diseases, linear models have so far appeared optimal in genotype-to-phenotype modeling. A key bottleneck for such modeling is that genetic data intrinsically suffers from underdetermination ([Form...

Descripción completa

Detalles Bibliográficos
Autores principales: Verplaetse, Nora, Passemiers, Antoine, Arany, Adam, Moreau, Yves, Raimondi, Daniele
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10552306/
https://www.ncbi.nlm.nih.gov/pubmed/37798735
http://dx.doi.org/10.1186/s13059-023-03064-y
Descripción
Sumario:BACKGROUND: Despite clear evidence of nonlinear interactions in the molecular architecture of polygenic diseases, linear models have so far appeared optimal in genotype-to-phenotype modeling. A key bottleneck for such modeling is that genetic data intrinsically suffers from underdetermination ([Formula: see text] ). Millions of variants are present in each individual while the collection of large, homogeneous cohorts is hindered by phenotype incidence, sequencing cost, and batch effects. RESULTS: We demonstrate that when we provide enough training data and control the complexity of nonlinear models, a neural network outperforms additive approaches in whole exome sequencing-based inflammatory bowel disease case–control prediction. To do so, we propose a biologically meaningful sparsified neural network architecture, providing empirical evidence for positive and negative epistatic effects present in the inflammatory bowel disease pathogenesis. CONCLUSIONS: In this paper, we show that underdetermination is likely a major driver for the apparent optimality of additive modeling in clinical genetics today. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03064-y.