Cargando…

On the Use of Topological Features of Metabolic Networks for the Classification of Cancer Samples

BACKGROUND: The increasing availability of omics data collected from patients affected by severe pathologies, such as cancer, is fostering the development of data science methods for their analysis. INTRODUCTION: The combination of data integration and machine learning approaches can provide new pow...

Descripción completa

Detalles Bibliográficos
Autores principales: Machicao, Jeaneth, Craighero, Francesco, Maspero, Davide, Angaroni, Fabrizio, Damiani, Chiara, Graudenzi, Alex, Antoniotti, Marco, Bruno, Odemir M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Bentham Science Publishers 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8188584/
https://www.ncbi.nlm.nih.gov/pubmed/34220296
http://dx.doi.org/10.2174/1389202922666210301084151
_version_ 1783705359520628736
author Machicao, Jeaneth
Craighero, Francesco
Maspero, Davide
Angaroni, Fabrizio
Damiani, Chiara
Graudenzi, Alex
Antoniotti, Marco
Bruno, Odemir M.
author_facet Machicao, Jeaneth
Craighero, Francesco
Maspero, Davide
Angaroni, Fabrizio
Damiani, Chiara
Graudenzi, Alex
Antoniotti, Marco
Bruno, Odemir M.
author_sort Machicao, Jeaneth
collection PubMed
description BACKGROUND: The increasing availability of omics data collected from patients affected by severe pathologies, such as cancer, is fostering the development of data science methods for their analysis. INTRODUCTION: The combination of data integration and machine learning approaches can provide new powerful instruments to tackle the complexity of cancer development and deliver effective diagnostic and prognostic strategies. METHODS: We explore the possibility of exploiting the topological properties of sample-specific metabolic networks as features in a supervised classification task. Such networks are obtained by projecting transcriptomic data from RNA-seq experiments on genome-wide metabolic models to define weighted networks modeling the overall metabolic activity of a given sample. RESULTS: We show the classification results on a labeled breast cancer dataset from the TCGA database, including 210 samples (cancer vs. normal). In particular, we investigate how the performance is affected by a threshold-based pruning of the networks by comparing Artificial Neural Networks, Support Vector Machines and Random Forests. Interestingly, the best classification performance is achieved within a small threshold range for all methods, suggesting that it might represent an effective choice to recover useful information while filtering out noise from data. Overall, the best accuracy is achieved with SVMs, which exhibit performances similar to those obtained when gene expression profiles are used as features. CONCLUSION: These findings demonstrate that the topological properties of sample-specific metabolic networks are effective in classifying cancer and normal samples, suggesting that useful information can be extracted from a relatively limited number of features.
format Online
Article
Text
id pubmed-8188584
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Bentham Science Publishers
record_format MEDLINE/PubMed
spelling pubmed-81885842021-08-01 On the Use of Topological Features of Metabolic Networks for the Classification of Cancer Samples Machicao, Jeaneth Craighero, Francesco Maspero, Davide Angaroni, Fabrizio Damiani, Chiara Graudenzi, Alex Antoniotti, Marco Bruno, Odemir M. Curr Genomics Article BACKGROUND: The increasing availability of omics data collected from patients affected by severe pathologies, such as cancer, is fostering the development of data science methods for their analysis. INTRODUCTION: The combination of data integration and machine learning approaches can provide new powerful instruments to tackle the complexity of cancer development and deliver effective diagnostic and prognostic strategies. METHODS: We explore the possibility of exploiting the topological properties of sample-specific metabolic networks as features in a supervised classification task. Such networks are obtained by projecting transcriptomic data from RNA-seq experiments on genome-wide metabolic models to define weighted networks modeling the overall metabolic activity of a given sample. RESULTS: We show the classification results on a labeled breast cancer dataset from the TCGA database, including 210 samples (cancer vs. normal). In particular, we investigate how the performance is affected by a threshold-based pruning of the networks by comparing Artificial Neural Networks, Support Vector Machines and Random Forests. Interestingly, the best classification performance is achieved within a small threshold range for all methods, suggesting that it might represent an effective choice to recover useful information while filtering out noise from data. Overall, the best accuracy is achieved with SVMs, which exhibit performances similar to those obtained when gene expression profiles are used as features. CONCLUSION: These findings demonstrate that the topological properties of sample-specific metabolic networks are effective in classifying cancer and normal samples, suggesting that useful information can be extracted from a relatively limited number of features. Bentham Science Publishers 2021-02 2021-02 /pmc/articles/PMC8188584/ /pubmed/34220296 http://dx.doi.org/10.2174/1389202922666210301084151 Text en © 2021 Bentham Science Publishers https://creativecommons.org/licenses/by-nc/4.0/ This is an open access article licensed under the terms of the Creative Commons Attribution-Non-Commercial 4.0 International Public License (CC BY-NC 4.0) (https://creativecommons.org/licenses/by-nc/4.0/legalcode), which permits unrestricted, non-commercial use, distribution and reproduction in any medium, provided the work is properly cited.
spellingShingle Article
Machicao, Jeaneth
Craighero, Francesco
Maspero, Davide
Angaroni, Fabrizio
Damiani, Chiara
Graudenzi, Alex
Antoniotti, Marco
Bruno, Odemir M.
On the Use of Topological Features of Metabolic Networks for the Classification of Cancer Samples
title On the Use of Topological Features of Metabolic Networks for the Classification of Cancer Samples
title_full On the Use of Topological Features of Metabolic Networks for the Classification of Cancer Samples
title_fullStr On the Use of Topological Features of Metabolic Networks for the Classification of Cancer Samples
title_full_unstemmed On the Use of Topological Features of Metabolic Networks for the Classification of Cancer Samples
title_short On the Use of Topological Features of Metabolic Networks for the Classification of Cancer Samples
title_sort on the use of topological features of metabolic networks for the classification of cancer samples
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8188584/
https://www.ncbi.nlm.nih.gov/pubmed/34220296
http://dx.doi.org/10.2174/1389202922666210301084151
work_keys_str_mv AT machicaojeaneth ontheuseoftopologicalfeaturesofmetabolicnetworksfortheclassificationofcancersamples
AT craigherofrancesco ontheuseoftopologicalfeaturesofmetabolicnetworksfortheclassificationofcancersamples
AT masperodavide ontheuseoftopologicalfeaturesofmetabolicnetworksfortheclassificationofcancersamples
AT angaronifabrizio ontheuseoftopologicalfeaturesofmetabolicnetworksfortheclassificationofcancersamples
AT damianichiara ontheuseoftopologicalfeaturesofmetabolicnetworksfortheclassificationofcancersamples
AT graudenzialex ontheuseoftopologicalfeaturesofmetabolicnetworksfortheclassificationofcancersamples
AT antoniottimarco ontheuseoftopologicalfeaturesofmetabolicnetworksfortheclassificationofcancersamples
AT brunoodemirm ontheuseoftopologicalfeaturesofmetabolicnetworksfortheclassificationofcancersamples