Cargando…

Multi-omics disease module detection with an explainable Greedy Decision Forest

Machine learning methods can detect complex relationships between variables, but usually do not exploit domain knowledge. This is a limitation because in many scientific disciplines, such as systems biology, domain knowledge is available in the form of graphs or networks, and its use can improve mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Pfeifer, Bastian, Baniecki, Hubert, Saranti, Anna, Biecek, Przemyslaw, Holzinger, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9546860/
https://www.ncbi.nlm.nih.gov/pubmed/36207536
http://dx.doi.org/10.1038/s41598-022-21417-8
_version_ 1784805138952617984
author Pfeifer, Bastian
Baniecki, Hubert
Saranti, Anna
Biecek, Przemyslaw
Holzinger, Andreas
author_facet Pfeifer, Bastian
Baniecki, Hubert
Saranti, Anna
Biecek, Przemyslaw
Holzinger, Andreas
author_sort Pfeifer, Bastian
collection PubMed
description Machine learning methods can detect complex relationships between variables, but usually do not exploit domain knowledge. This is a limitation because in many scientific disciplines, such as systems biology, domain knowledge is available in the form of graphs or networks, and its use can improve model performance. We need network-based algorithms that are versatile and applicable in many research areas. In this work, we demonstrate subnetwork detection based on multi-modal node features using a novel Greedy Decision Forest (GDF) with inherent interpretability. The latter will be a crucial factor to retain experts and gain their trust in such algorithms. To demonstrate a concrete application example, we focus on bioinformatics, systems biology and particularly biomedicine, but the presented methodology is applicable in many other domains as well. Systems biology is a good example of a field in which statistical data-driven machine learning enables the analysis of large amounts of multi-modal biomedical data. This is important to reach the future goal of precision medicine, where the complexity of patients is modeled on a system level to best tailor medical decisions, health practices and therapies to the individual patient. Our proposed explainable approach can help to uncover disease-causing network modules from multi-omics data to better understand complex diseases such as cancer.
format Online
Article
Text
id pubmed-9546860
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-95468602022-10-09 Multi-omics disease module detection with an explainable Greedy Decision Forest Pfeifer, Bastian Baniecki, Hubert Saranti, Anna Biecek, Przemyslaw Holzinger, Andreas Sci Rep Article Machine learning methods can detect complex relationships between variables, but usually do not exploit domain knowledge. This is a limitation because in many scientific disciplines, such as systems biology, domain knowledge is available in the form of graphs or networks, and its use can improve model performance. We need network-based algorithms that are versatile and applicable in many research areas. In this work, we demonstrate subnetwork detection based on multi-modal node features using a novel Greedy Decision Forest (GDF) with inherent interpretability. The latter will be a crucial factor to retain experts and gain their trust in such algorithms. To demonstrate a concrete application example, we focus on bioinformatics, systems biology and particularly biomedicine, but the presented methodology is applicable in many other domains as well. Systems biology is a good example of a field in which statistical data-driven machine learning enables the analysis of large amounts of multi-modal biomedical data. This is important to reach the future goal of precision medicine, where the complexity of patients is modeled on a system level to best tailor medical decisions, health practices and therapies to the individual patient. Our proposed explainable approach can help to uncover disease-causing network modules from multi-omics data to better understand complex diseases such as cancer. Nature Publishing Group UK 2022-10-07 /pmc/articles/PMC9546860/ /pubmed/36207536 http://dx.doi.org/10.1038/s41598-022-21417-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Pfeifer, Bastian
Baniecki, Hubert
Saranti, Anna
Biecek, Przemyslaw
Holzinger, Andreas
Multi-omics disease module detection with an explainable Greedy Decision Forest
title Multi-omics disease module detection with an explainable Greedy Decision Forest
title_full Multi-omics disease module detection with an explainable Greedy Decision Forest
title_fullStr Multi-omics disease module detection with an explainable Greedy Decision Forest
title_full_unstemmed Multi-omics disease module detection with an explainable Greedy Decision Forest
title_short Multi-omics disease module detection with an explainable Greedy Decision Forest
title_sort multi-omics disease module detection with an explainable greedy decision forest
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9546860/
https://www.ncbi.nlm.nih.gov/pubmed/36207536
http://dx.doi.org/10.1038/s41598-022-21417-8
work_keys_str_mv AT pfeiferbastian multiomicsdiseasemoduledetectionwithanexplainablegreedydecisionforest
AT banieckihubert multiomicsdiseasemoduledetectionwithanexplainablegreedydecisionforest
AT sarantianna multiomicsdiseasemoduledetectionwithanexplainablegreedydecisionforest
AT biecekprzemyslaw multiomicsdiseasemoduledetectionwithanexplainablegreedydecisionforest
AT holzingerandreas multiomicsdiseasemoduledetectionwithanexplainablegreedydecisionforest