Cargando…
A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping
Motivation: Identifying new genetic associations in non-Mendelian complex diseases is an increasingly difficult challenge. These diseases sometimes appear to have a significant component of heritability requiring explanation, and this missing heritability may be due to the existence of subtypes invo...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9207464/ https://www.ncbi.nlm.nih.gov/pubmed/35734430 http://dx.doi.org/10.3389/fgene.2022.859462 |
_version_ | 1784729537773305856 |
---|---|
author | Courbariaux, Marie De Santiago, Kylliann Dalmasso, Cyril Danjou, Fabrice Bekadar, Samir Corvol, Jean-Christophe Martinez, Maria Szafranski, Marie Ambroise, Christophe |
author_facet | Courbariaux, Marie De Santiago, Kylliann Dalmasso, Cyril Danjou, Fabrice Bekadar, Samir Corvol, Jean-Christophe Martinez, Maria Szafranski, Marie Ambroise, Christophe |
author_sort | Courbariaux, Marie |
collection | PubMed |
description | Motivation: Identifying new genetic associations in non-Mendelian complex diseases is an increasingly difficult challenge. These diseases sometimes appear to have a significant component of heritability requiring explanation, and this missing heritability may be due to the existence of subtypes involving different genetic factors. Taking genetic information into account in clinical trials might potentially have a role in guiding the process of subtyping a complex disease. Most methods dealing with multiple sources of information rely on data transformation, and in disease subtyping, the two main strategies used are 1) the clustering of clinical data followed by posterior genetic analysis and 2) the concomitant clustering of clinical and genetic variables. Both of these strategies have limitations that we propose to address. Contribution: This work proposes an original method for disease subtyping on the basis of both longitudinal clinical variables and high-dimensional genetic markers via a sparse mixture-of-regressions model. The added value of our approach lies in its interpretability in relation to two aspects. First, our model links both clinical and genetic data with regard to their initial nature (i.e., without transformation) and does not require post-processing where the original information is accessed a second time to interpret the subtypes. Second, it can address large-scale problems because of a variable selection step that is used to discard genetic variables that may not be relevant for subtyping. Results: The proposed method was validated on simulations. A dataset from a cohort of Parkinson’s disease patients was also analyzed. Several subtypes of the disease and genetic variants that potentially have a role in this typology were identified. Software availability: The R code for the proposed method, named DiSuGen, and a tutorial are available for download (see the references). |
format | Online Article Text |
id | pubmed-9207464 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92074642022-06-21 A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping Courbariaux, Marie De Santiago, Kylliann Dalmasso, Cyril Danjou, Fabrice Bekadar, Samir Corvol, Jean-Christophe Martinez, Maria Szafranski, Marie Ambroise, Christophe Front Genet Genetics Motivation: Identifying new genetic associations in non-Mendelian complex diseases is an increasingly difficult challenge. These diseases sometimes appear to have a significant component of heritability requiring explanation, and this missing heritability may be due to the existence of subtypes involving different genetic factors. Taking genetic information into account in clinical trials might potentially have a role in guiding the process of subtyping a complex disease. Most methods dealing with multiple sources of information rely on data transformation, and in disease subtyping, the two main strategies used are 1) the clustering of clinical data followed by posterior genetic analysis and 2) the concomitant clustering of clinical and genetic variables. Both of these strategies have limitations that we propose to address. Contribution: This work proposes an original method for disease subtyping on the basis of both longitudinal clinical variables and high-dimensional genetic markers via a sparse mixture-of-regressions model. The added value of our approach lies in its interpretability in relation to two aspects. First, our model links both clinical and genetic data with regard to their initial nature (i.e., without transformation) and does not require post-processing where the original information is accessed a second time to interpret the subtypes. Second, it can address large-scale problems because of a variable selection step that is used to discard genetic variables that may not be relevant for subtyping. Results: The proposed method was validated on simulations. A dataset from a cohort of Parkinson’s disease patients was also analyzed. Several subtypes of the disease and genetic variants that potentially have a role in this typology were identified. Software availability: The R code for the proposed method, named DiSuGen, and a tutorial are available for download (see the references). Frontiers Media S.A. 2022-06-06 /pmc/articles/PMC9207464/ /pubmed/35734430 http://dx.doi.org/10.3389/fgene.2022.859462 Text en Copyright © 2022 Courbariaux, De Santiago, Dalmasso, Danjou, Bekadar, Corvol, Martinez, Szafranski and Ambroise. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Courbariaux, Marie De Santiago, Kylliann Dalmasso, Cyril Danjou, Fabrice Bekadar, Samir Corvol, Jean-Christophe Martinez, Maria Szafranski, Marie Ambroise, Christophe A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping |
title | A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping |
title_full | A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping |
title_fullStr | A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping |
title_full_unstemmed | A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping |
title_short | A Sparse Mixture-of-Experts Model With Screening of Genetic Associations to Guide Disease Subtyping |
title_sort | sparse mixture-of-experts model with screening of genetic associations to guide disease subtyping |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9207464/ https://www.ncbi.nlm.nih.gov/pubmed/35734430 http://dx.doi.org/10.3389/fgene.2022.859462 |
work_keys_str_mv | AT courbariauxmarie asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT desantiagokylliann asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT dalmassocyril asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT danjoufabrice asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT bekadarsamir asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT corvoljeanchristophe asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT martinezmaria asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT szafranskimarie asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT ambroisechristophe asparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT courbariauxmarie sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT desantiagokylliann sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT dalmassocyril sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT danjoufabrice sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT bekadarsamir sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT corvoljeanchristophe sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT martinezmaria sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT szafranskimarie sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping AT ambroisechristophe sparsemixtureofexpertsmodelwithscreeningofgeneticassociationstoguidediseasesubtyping |