Cargando…

De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet

BACKGROUND: With a growing amount of (multi-)omics data being available, the extraction of knowledge from these datasets is still a difficult problem. Classical enrichment-style analyses require predefined pathways or gene sets that are tested for significant deregulation to assess whether the pathw...

Descripción completa

Detalles Bibliográficos
Autores principales: Winkler, Sebastian, Winkler, Ivana, Figaschewski, Mirjam, Tiede, Thorsten, Nordheim, Alfred, Kohlbacher, Oliver
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9020058/
https://www.ncbi.nlm.nih.gov/pubmed/35439941
http://dx.doi.org/10.1186/s12859-022-04670-6
_version_ 1784689448035811328
author Winkler, Sebastian
Winkler, Ivana
Figaschewski, Mirjam
Tiede, Thorsten
Nordheim, Alfred
Kohlbacher, Oliver
author_facet Winkler, Sebastian
Winkler, Ivana
Figaschewski, Mirjam
Tiede, Thorsten
Nordheim, Alfred
Kohlbacher, Oliver
author_sort Winkler, Sebastian
collection PubMed
description BACKGROUND: With a growing amount of (multi-)omics data being available, the extraction of knowledge from these datasets is still a difficult problem. Classical enrichment-style analyses require predefined pathways or gene sets that are tested for significant deregulation to assess whether the pathway is functionally involved in the biological process under study. De novo identification of these pathways can reduce the bias inherent in predefined pathways or gene sets. At the same time, the definition and efficient identification of these pathways de novo from large biological networks is a challenging problem. RESULTS: We present a novel algorithm, DeRegNet, for the identification of maximally deregulated subnetworks on directed graphs based on deregulation scores derived from (multi-)omics data. DeRegNet can be interpreted as maximum likelihood estimation given a certain probabilistic model for de-novo subgraph identification. We use fractional integer programming to solve the resulting combinatorial optimization problem. We can show that the approach outperforms related algorithms on simulated data with known ground truths. On a publicly available liver cancer dataset we can show that DeRegNet can identify biologically meaningful subgraphs suitable for patient stratification. DeRegNet can also be used to find explicitly multi-omics subgraphs which we demonstrate by presenting subgraphs with consistent methylation-transcription patterns. DeRegNet is freely available as open-source software. CONCLUSION: The proposed algorithmic framework and its available implementation can serve as a valuable heuristic hypothesis generation tool contextualizing omics data within biomolecular networks.
format Online
Article
Text
id pubmed-9020058
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90200582022-04-21 De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet Winkler, Sebastian Winkler, Ivana Figaschewski, Mirjam Tiede, Thorsten Nordheim, Alfred Kohlbacher, Oliver BMC Bioinformatics Research BACKGROUND: With a growing amount of (multi-)omics data being available, the extraction of knowledge from these datasets is still a difficult problem. Classical enrichment-style analyses require predefined pathways or gene sets that are tested for significant deregulation to assess whether the pathway is functionally involved in the biological process under study. De novo identification of these pathways can reduce the bias inherent in predefined pathways or gene sets. At the same time, the definition and efficient identification of these pathways de novo from large biological networks is a challenging problem. RESULTS: We present a novel algorithm, DeRegNet, for the identification of maximally deregulated subnetworks on directed graphs based on deregulation scores derived from (multi-)omics data. DeRegNet can be interpreted as maximum likelihood estimation given a certain probabilistic model for de-novo subgraph identification. We use fractional integer programming to solve the resulting combinatorial optimization problem. We can show that the approach outperforms related algorithms on simulated data with known ground truths. On a publicly available liver cancer dataset we can show that DeRegNet can identify biologically meaningful subgraphs suitable for patient stratification. DeRegNet can also be used to find explicitly multi-omics subgraphs which we demonstrate by presenting subgraphs with consistent methylation-transcription patterns. DeRegNet is freely available as open-source software. CONCLUSION: The proposed algorithmic framework and its available implementation can serve as a valuable heuristic hypothesis generation tool contextualizing omics data within biomolecular networks. BioMed Central 2022-04-19 /pmc/articles/PMC9020058/ /pubmed/35439941 http://dx.doi.org/10.1186/s12859-022-04670-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Winkler, Sebastian
Winkler, Ivana
Figaschewski, Mirjam
Tiede, Thorsten
Nordheim, Alfred
Kohlbacher, Oliver
De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet
title De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet
title_full De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet
title_fullStr De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet
title_full_unstemmed De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet
title_short De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet
title_sort de novo identification of maximally deregulated subnetworks based on multi-omics data with deregnet
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9020058/
https://www.ncbi.nlm.nih.gov/pubmed/35439941
http://dx.doi.org/10.1186/s12859-022-04670-6
work_keys_str_mv AT winklersebastian denovoidentificationofmaximallyderegulatedsubnetworksbasedonmultiomicsdatawithderegnet
AT winklerivana denovoidentificationofmaximallyderegulatedsubnetworksbasedonmultiomicsdatawithderegnet
AT figaschewskimirjam denovoidentificationofmaximallyderegulatedsubnetworksbasedonmultiomicsdatawithderegnet
AT tiedethorsten denovoidentificationofmaximallyderegulatedsubnetworksbasedonmultiomicsdatawithderegnet
AT nordheimalfred denovoidentificationofmaximallyderegulatedsubnetworksbasedonmultiomicsdatawithderegnet
AT kohlbacheroliver denovoidentificationofmaximallyderegulatedsubnetworksbasedonmultiomicsdatawithderegnet