Cargando…

Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data

The constantly and rapidly increasing amount of the biological data gained from many different high-throughput experiments opens up new possibilities for data- and model-driven inference. Yet, alongside, emerges a problem of risks related to data integration techniques. The latter are not so widely...

Descripción completa

Detalles Bibliográficos
Autores principales: Gogolewski, Krzysztof, Kostecki, Marcin, Gambin, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7712986/
https://www.ncbi.nlm.nih.gov/pubmed/33287006
http://dx.doi.org/10.3390/e22111238
_version_ 1783618491587231744
author Gogolewski, Krzysztof
Kostecki, Marcin
Gambin, Anna
author_facet Gogolewski, Krzysztof
Kostecki, Marcin
Gambin, Anna
author_sort Gogolewski, Krzysztof
collection PubMed
description The constantly and rapidly increasing amount of the biological data gained from many different high-throughput experiments opens up new possibilities for data- and model-driven inference. Yet, alongside, emerges a problem of risks related to data integration techniques. The latter are not so widely taken account of. Especially, the approaches based on the flux balance analysis (FBA) are sensitive to the structure of a metabolic network for which the low-entropy clusters can prevent the inference from the activity of the metabolic reactions. In the following article, we set forth problems that may arise during the integration of metabolomic data with gene expression datasets. We analyze common pitfalls, provide their possible solutions, and exemplify them by a case study of the renal cell carcinoma (RCC). Using the proposed approach we provide a metabolic description of the known morphological RCC subtypes and suggest a possible existence of the poor-prognosis cluster of patients, which are commonly characterized by the low activity of the drug transporting enzymes crucial in the chemotherapy. This discovery suits and extends the already known poor-prognosis characteristics of RCC. Finally, the goal of this work is also to point out the problem that arises from the integration of high-throughput data with the inherently nonuniform, manually curated low-throughput data. In such cases, the over-represented information may potentially overshadow the non-trivial discoveries.
format Online
Article
Text
id pubmed-7712986
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77129862021-02-24 Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data Gogolewski, Krzysztof Kostecki, Marcin Gambin, Anna Entropy (Basel) Article The constantly and rapidly increasing amount of the biological data gained from many different high-throughput experiments opens up new possibilities for data- and model-driven inference. Yet, alongside, emerges a problem of risks related to data integration techniques. The latter are not so widely taken account of. Especially, the approaches based on the flux balance analysis (FBA) are sensitive to the structure of a metabolic network for which the low-entropy clusters can prevent the inference from the activity of the metabolic reactions. In the following article, we set forth problems that may arise during the integration of metabolomic data with gene expression datasets. We analyze common pitfalls, provide their possible solutions, and exemplify them by a case study of the renal cell carcinoma (RCC). Using the proposed approach we provide a metabolic description of the known morphological RCC subtypes and suggest a possible existence of the poor-prognosis cluster of patients, which are commonly characterized by the low activity of the drug transporting enzymes crucial in the chemotherapy. This discovery suits and extends the already known poor-prognosis characteristics of RCC. Finally, the goal of this work is also to point out the problem that arises from the integration of high-throughput data with the inherently nonuniform, manually curated low-throughput data. In such cases, the over-represented information may potentially overshadow the non-trivial discoveries. MDPI 2020-10-31 /pmc/articles/PMC7712986/ /pubmed/33287006 http://dx.doi.org/10.3390/e22111238 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gogolewski, Krzysztof
Kostecki, Marcin
Gambin, Anna
Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data
title Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data
title_full Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data
title_fullStr Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data
title_full_unstemmed Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data
title_short Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data
title_sort low entropy sub-networks prevent the integration of metabolomic and transcriptomic data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7712986/
https://www.ncbi.nlm.nih.gov/pubmed/33287006
http://dx.doi.org/10.3390/e22111238
work_keys_str_mv AT gogolewskikrzysztof lowentropysubnetworkspreventtheintegrationofmetabolomicandtranscriptomicdata
AT kosteckimarcin lowentropysubnetworkspreventtheintegrationofmetabolomicandtranscriptomicdata
AT gambinanna lowentropysubnetworkspreventtheintegrationofmetabolomicandtranscriptomicdata