Cargando…
Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data
The constantly and rapidly increasing amount of the biological data gained from many different high-throughput experiments opens up new possibilities for data- and model-driven inference. Yet, alongside, emerges a problem of risks related to data integration techniques. The latter are not so widely...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7712986/ https://www.ncbi.nlm.nih.gov/pubmed/33287006 http://dx.doi.org/10.3390/e22111238 |
_version_ | 1783618491587231744 |
---|---|
author | Gogolewski, Krzysztof Kostecki, Marcin Gambin, Anna |
author_facet | Gogolewski, Krzysztof Kostecki, Marcin Gambin, Anna |
author_sort | Gogolewski, Krzysztof |
collection | PubMed |
description | The constantly and rapidly increasing amount of the biological data gained from many different high-throughput experiments opens up new possibilities for data- and model-driven inference. Yet, alongside, emerges a problem of risks related to data integration techniques. The latter are not so widely taken account of. Especially, the approaches based on the flux balance analysis (FBA) are sensitive to the structure of a metabolic network for which the low-entropy clusters can prevent the inference from the activity of the metabolic reactions. In the following article, we set forth problems that may arise during the integration of metabolomic data with gene expression datasets. We analyze common pitfalls, provide their possible solutions, and exemplify them by a case study of the renal cell carcinoma (RCC). Using the proposed approach we provide a metabolic description of the known morphological RCC subtypes and suggest a possible existence of the poor-prognosis cluster of patients, which are commonly characterized by the low activity of the drug transporting enzymes crucial in the chemotherapy. This discovery suits and extends the already known poor-prognosis characteristics of RCC. Finally, the goal of this work is also to point out the problem that arises from the integration of high-throughput data with the inherently nonuniform, manually curated low-throughput data. In such cases, the over-represented information may potentially overshadow the non-trivial discoveries. |
format | Online Article Text |
id | pubmed-7712986 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-77129862021-02-24 Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data Gogolewski, Krzysztof Kostecki, Marcin Gambin, Anna Entropy (Basel) Article The constantly and rapidly increasing amount of the biological data gained from many different high-throughput experiments opens up new possibilities for data- and model-driven inference. Yet, alongside, emerges a problem of risks related to data integration techniques. The latter are not so widely taken account of. Especially, the approaches based on the flux balance analysis (FBA) are sensitive to the structure of a metabolic network for which the low-entropy clusters can prevent the inference from the activity of the metabolic reactions. In the following article, we set forth problems that may arise during the integration of metabolomic data with gene expression datasets. We analyze common pitfalls, provide their possible solutions, and exemplify them by a case study of the renal cell carcinoma (RCC). Using the proposed approach we provide a metabolic description of the known morphological RCC subtypes and suggest a possible existence of the poor-prognosis cluster of patients, which are commonly characterized by the low activity of the drug transporting enzymes crucial in the chemotherapy. This discovery suits and extends the already known poor-prognosis characteristics of RCC. Finally, the goal of this work is also to point out the problem that arises from the integration of high-throughput data with the inherently nonuniform, manually curated low-throughput data. In such cases, the over-represented information may potentially overshadow the non-trivial discoveries. MDPI 2020-10-31 /pmc/articles/PMC7712986/ /pubmed/33287006 http://dx.doi.org/10.3390/e22111238 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Gogolewski, Krzysztof Kostecki, Marcin Gambin, Anna Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data |
title | Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data |
title_full | Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data |
title_fullStr | Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data |
title_full_unstemmed | Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data |
title_short | Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data |
title_sort | low entropy sub-networks prevent the integration of metabolomic and transcriptomic data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7712986/ https://www.ncbi.nlm.nih.gov/pubmed/33287006 http://dx.doi.org/10.3390/e22111238 |
work_keys_str_mv | AT gogolewskikrzysztof lowentropysubnetworkspreventtheintegrationofmetabolomicandtranscriptomicdata AT kosteckimarcin lowentropysubnetworkspreventtheintegrationofmetabolomicandtranscriptomicdata AT gambinanna lowentropysubnetworkspreventtheintegrationofmetabolomicandtranscriptomicdata |