Cargando…
Integrating multiomics and prior knowledge: a study of the Graphnet penalty impact
MOTIVATION: In the field of oncology, statistical models are used for the discovery of candidate factors that influence the development of the pathology or its outcome. These statistical models can be designed in a multiblock framework to study the relationship between different multiomic data, and...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10403429/ https://www.ncbi.nlm.nih.gov/pubmed/37490467 http://dx.doi.org/10.1093/bioinformatics/btad454 |
_version_ | 1785085066837229568 |
---|---|
author | Chegraoui, Hamza Guillemot, Vincent Rebei, Amine Gloaguen, Arnaud Grill, Jacques Philippe, Cathy Frouin, Vincent |
author_facet | Chegraoui, Hamza Guillemot, Vincent Rebei, Amine Gloaguen, Arnaud Grill, Jacques Philippe, Cathy Frouin, Vincent |
author_sort | Chegraoui, Hamza |
collection | PubMed |
description | MOTIVATION: In the field of oncology, statistical models are used for the discovery of candidate factors that influence the development of the pathology or its outcome. These statistical models can be designed in a multiblock framework to study the relationship between different multiomic data, and variable selection is often achieved by imposing constraints on the model parameters. A priori graph constraints have been used in the literature as a way to improve feature selection in the model, yielding more interpretability. However, it is still unclear how these graphs interact with the models and how they impact the feature selection. Additionally, with the availability of different graphs encoding different information, one can wonder how the choice of the graph meaningfully impacts the results obtained. RESULTS: We proposed to study the graph penalty impact on a multiblock model. Specifically, we used the SGCCA as the multiblock framework. We studied the effect of the penalty on the model using the TCGA-LGG dataset. Our findings are 3-fold. We showed that the graph penalty increases the number of selected genes from this dataset, while selecting genes already identified in other works as pertinent biomarkers in the pathology. We demonstrated that using different graphs leads to different though consistent results, but that graph density is the main factor influencing the obtained results. Finally, we showed that the graph penalty increases the performance of the survival prediction from the model-derived components and the interpretability of the results. AVAILABILITY AND IMPLEMENTATION: Source code is freely available at https://github.com/neurospin/netSGCCA |
format | Online Article Text |
id | pubmed-10403429 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-104034292023-08-06 Integrating multiomics and prior knowledge: a study of the Graphnet penalty impact Chegraoui, Hamza Guillemot, Vincent Rebei, Amine Gloaguen, Arnaud Grill, Jacques Philippe, Cathy Frouin, Vincent Bioinformatics Original Paper MOTIVATION: In the field of oncology, statistical models are used for the discovery of candidate factors that influence the development of the pathology or its outcome. These statistical models can be designed in a multiblock framework to study the relationship between different multiomic data, and variable selection is often achieved by imposing constraints on the model parameters. A priori graph constraints have been used in the literature as a way to improve feature selection in the model, yielding more interpretability. However, it is still unclear how these graphs interact with the models and how they impact the feature selection. Additionally, with the availability of different graphs encoding different information, one can wonder how the choice of the graph meaningfully impacts the results obtained. RESULTS: We proposed to study the graph penalty impact on a multiblock model. Specifically, we used the SGCCA as the multiblock framework. We studied the effect of the penalty on the model using the TCGA-LGG dataset. Our findings are 3-fold. We showed that the graph penalty increases the number of selected genes from this dataset, while selecting genes already identified in other works as pertinent biomarkers in the pathology. We demonstrated that using different graphs leads to different though consistent results, but that graph density is the main factor influencing the obtained results. Finally, we showed that the graph penalty increases the performance of the survival prediction from the model-derived components and the interpretability of the results. AVAILABILITY AND IMPLEMENTATION: Source code is freely available at https://github.com/neurospin/netSGCCA Oxford University Press 2023-07-25 /pmc/articles/PMC10403429/ /pubmed/37490467 http://dx.doi.org/10.1093/bioinformatics/btad454 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Chegraoui, Hamza Guillemot, Vincent Rebei, Amine Gloaguen, Arnaud Grill, Jacques Philippe, Cathy Frouin, Vincent Integrating multiomics and prior knowledge: a study of the Graphnet penalty impact |
title | Integrating multiomics and prior knowledge: a study of the Graphnet penalty impact |
title_full | Integrating multiomics and prior knowledge: a study of the Graphnet penalty impact |
title_fullStr | Integrating multiomics and prior knowledge: a study of the Graphnet penalty impact |
title_full_unstemmed | Integrating multiomics and prior knowledge: a study of the Graphnet penalty impact |
title_short | Integrating multiomics and prior knowledge: a study of the Graphnet penalty impact |
title_sort | integrating multiomics and prior knowledge: a study of the graphnet penalty impact |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10403429/ https://www.ncbi.nlm.nih.gov/pubmed/37490467 http://dx.doi.org/10.1093/bioinformatics/btad454 |
work_keys_str_mv | AT chegraouihamza integratingmultiomicsandpriorknowledgeastudyofthegraphnetpenaltyimpact AT guillemotvincent integratingmultiomicsandpriorknowledgeastudyofthegraphnetpenaltyimpact AT rebeiamine integratingmultiomicsandpriorknowledgeastudyofthegraphnetpenaltyimpact AT gloaguenarnaud integratingmultiomicsandpriorknowledgeastudyofthegraphnetpenaltyimpact AT grilljacques integratingmultiomicsandpriorknowledgeastudyofthegraphnetpenaltyimpact AT philippecathy integratingmultiomicsandpriorknowledgeastudyofthegraphnetpenaltyimpact AT frouinvincent integratingmultiomicsandpriorknowledgeastudyofthegraphnetpenaltyimpact |