Cargando…

A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping

Motivation: Identifying new genetic associations in non-Mendelian complex diseases is an increasingly difficult challenge. These diseases sometimes appear to have a significant component of heritability requiring explanation, and this missing heritability may be due to the existence of subtypes invo...

Descripción completa

Detalles Bibliográficos
Autores principales: Courbariaux, Marie, De Santiago, Kylliann, Dalmasso, Cyril, Danjou, Fabrice, Bekadar, Samir, Corvol, Jean-Christophe, Martinez, Maria, Szafranski, Marie, Ambroise, Christophe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9207464/
https://www.ncbi.nlm.nih.gov/pubmed/35734430
http://dx.doi.org/10.3389/fgene.2022.859462
_version_ 1784729537773305856
author Courbariaux, Marie
De Santiago, Kylliann
Dalmasso, Cyril
Danjou, Fabrice
Bekadar, Samir
Corvol, Jean-Christophe
Martinez, Maria
Szafranski, Marie
Ambroise, Christophe
author_facet Courbariaux, Marie
De Santiago, Kylliann
Dalmasso, Cyril
Danjou, Fabrice
Bekadar, Samir
Corvol, Jean-Christophe
Martinez, Maria
Szafranski, Marie
Ambroise, Christophe
author_sort Courbariaux, Marie
collection PubMed
description Motivation: Identifying new genetic associations in non-Mendelian complex diseases is an increasingly difficult challenge. These diseases sometimes appear to have a significant component of heritability requiring explanation, and this missing heritability may be due to the existence of subtypes involving different genetic factors. Taking genetic information into account in clinical trials might potentially have a role in guiding the process of subtyping a complex disease. Most methods dealing with multiple sources of information rely on data transformation, and in disease subtyping, the two main strategies used are 1) the clustering of clinical data followed by posterior genetic analysis and 2) the concomitant clustering of clinical and genetic variables. Both of these strategies have limitations that we propose to address. Contribution: This work proposes an original method for disease subtyping on the basis of both longitudinal clinical variables and high-dimensional genetic markers via a sparse mixture-of-regressions model. The added value of our approach lies in its interpretability in relation to two aspects. First, our model links both clinical and genetic data with regard to their initial nature (i.e., without transformation) and does not require post-processing where the original information is accessed a second time to interpret the subtypes. Second, it can address large-scale problems because of a variable selection step that is used to discard genetic variables that may not be relevant for subtyping. Results: The proposed method was validated on simulations. A dataset from a cohort of Parkinson’s disease patients was also analyzed. Several subtypes of the disease and genetic variants that potentially have a role in this typology were identified. Software availability: The R code for the proposed method, named DiSuGen, and a tutorial are available for download (see the references).
format Online
Article
Text
id pubmed-9207464
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92074642022-06-21 A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping Courbariaux, Marie De Santiago, Kylliann Dalmasso, Cyril Danjou, Fabrice Bekadar, Samir Corvol, Jean-Christophe Martinez, Maria Szafranski, Marie Ambroise, Christophe Front Genet Genetics Motivation: Identifying new genetic associations in non-Mendelian complex diseases is an increasingly difficult challenge. These diseases sometimes appear to have a significant component of heritability requiring explanation, and this missing heritability may be due to the existence of subtypes involving different genetic factors. Taking genetic information into account in clinical trials might potentially have a role in guiding the process of subtyping a complex disease. Most methods dealing with multiple sources of information rely on data transformation, and in disease subtyping, the two main strategies used are 1) the clustering of clinical data followed by posterior genetic analysis and 2) the concomitant clustering of clinical and genetic variables. Both of these strategies have limitations that we propose to address. Contribution: This work proposes an original method for disease subtyping on the basis of both longitudinal clinical variables and high-dimensional genetic markers via a sparse mixture-of-regressions model. The added value of our approach lies in its interpretability in relation to two aspects. First, our model links both clinical and genetic data with regard to their initial nature (i.e., without transformation) and does not require post-processing where the original information is accessed a second time to interpret the subtypes. Second, it can address large-scale problems because of a variable selection step that is used to discard genetic variables that may not be relevant for subtyping. Results: The proposed method was validated on simulations. A dataset from a cohort of Parkinson’s disease patients was also analyzed. Several subtypes of the disease and genetic variants that potentially have a role in this typology were identified. Software availability: The R code for the proposed method, named DiSuGen, and a tutorial are available for download (see the references). Frontiers Media S.A. 2022-06-06 /pmc/articles/PMC9207464/ /pubmed/35734430 http://dx.doi.org/10.3389/fgene.2022.859462 Text en Copyright © 2022 Courbariaux, De Santiago, Dalmasso, Danjou, Bekadar, Corvol, Martinez, Szafranski and Ambroise. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Courbariaux, Marie
De Santiago, Kylliann
Dalmasso, Cyril
Danjou, Fabrice
Bekadar, Samir
Corvol, Jean-Christophe
Martinez, Maria
Szafranski, Marie
Ambroise, Christophe
A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping
title A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping
title_full A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping
title_fullStr A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping
title_full_unstemmed A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping
title_short A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping
title_sort sparse mixture-of-experts model with screening of genetic associations to guide disease subtyping
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9207464/
https://www.ncbi.nlm.nih.gov/pubmed/35734430
http://dx.doi.org/10.3389/fgene.2022.859462
work_keys_str_mv AT courbariauxmarie asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT desantiagokylliann asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT dalmassocyril asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT danjoufabrice asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT bekadarsamir asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT corvoljeanchristophe asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT martinezmaria asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT szafranskimarie asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT ambroisechristophe asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT courbariauxmarie sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT desantiagokylliann sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT dalmassocyril sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT danjoufabrice sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT bekadarsamir sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT corvoljeanchristophe sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT martinezmaria sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT szafranskimarie sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping
AT ambroisechristophe sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping