Cargando…

Modeling and Simulation of Cell Signaling Networks for Subsequent Analytics Processes Using Big Data and Machine Learning

This work explores how much the traditional approach to modeling and simulation of biological systems, specifically cell signaling networks, can be increased and improved by integrating big data, data mining, and machine learning techniques. Specifically, we first model, simulate, validate, and cali...

Descripción completa

Detalles Bibliográficos
Autores principales: Sánchez-Gutiérrez, Máximo Eduardo, González-Pérez, Pedro Pablo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9036331/
https://www.ncbi.nlm.nih.gov/pubmed/35478994
http://dx.doi.org/10.1177/11779322221091739
_version_ 1784693500956114944
author Sánchez-Gutiérrez, Máximo Eduardo
González-Pérez, Pedro Pablo
author_facet Sánchez-Gutiérrez, Máximo Eduardo
González-Pérez, Pedro Pablo
author_sort Sánchez-Gutiérrez, Máximo Eduardo
collection PubMed
description This work explores how much the traditional approach to modeling and simulation of biological systems, specifically cell signaling networks, can be increased and improved by integrating big data, data mining, and machine learning techniques. Specifically, we first model, simulate, validate, and calibrate the behavior of the PI3K/AKT/mTOR cancer-related signaling pathway. Subsequently, once the behavior of the simulated signaling network matches the expected behavior, the capacity of the computational simulation is increased to grow data (data farming). First, we use big data techniques to extract, collect, filter, and store large volumes of data describing all the interactions among the simulated cell signaling system components over time. Afterward, we apply data mining and machine learning techniques—specifically, exploratory data analysis, feature selection techniques, and supervised neural network models—to the resulting biological dataset to obtain new inferences and knowledge about this biological system. The results showed how the traditional approach to the simulation of biological systems could be enhanced and improved by incorporating big data, data mining, and machine learning techniques, which significantly contributed to increasing the predictive power of the simulation.
format Online
Article
Text
id pubmed-9036331
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-90363312022-04-26 Modeling and Simulation of Cell Signaling Networks for Subsequent Analytics Processes Using Big Data and Machine Learning Sánchez-Gutiérrez, Máximo Eduardo González-Pérez, Pedro Pablo Bioinform Biol Insights Original Research This work explores how much the traditional approach to modeling and simulation of biological systems, specifically cell signaling networks, can be increased and improved by integrating big data, data mining, and machine learning techniques. Specifically, we first model, simulate, validate, and calibrate the behavior of the PI3K/AKT/mTOR cancer-related signaling pathway. Subsequently, once the behavior of the simulated signaling network matches the expected behavior, the capacity of the computational simulation is increased to grow data (data farming). First, we use big data techniques to extract, collect, filter, and store large volumes of data describing all the interactions among the simulated cell signaling system components over time. Afterward, we apply data mining and machine learning techniques—specifically, exploratory data analysis, feature selection techniques, and supervised neural network models—to the resulting biological dataset to obtain new inferences and knowledge about this biological system. The results showed how the traditional approach to the simulation of biological systems could be enhanced and improved by incorporating big data, data mining, and machine learning techniques, which significantly contributed to increasing the predictive power of the simulation. SAGE Publications 2022-04-22 /pmc/articles/PMC9036331/ /pubmed/35478994 http://dx.doi.org/10.1177/11779322221091739 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Sánchez-Gutiérrez, Máximo Eduardo
González-Pérez, Pedro Pablo
Modeling and Simulation of Cell Signaling Networks for Subsequent Analytics Processes Using Big Data and Machine Learning
title Modeling and Simulation of Cell Signaling Networks for Subsequent Analytics Processes Using Big Data and Machine Learning
title_full Modeling and Simulation of Cell Signaling Networks for Subsequent Analytics Processes Using Big Data and Machine Learning
title_fullStr Modeling and Simulation of Cell Signaling Networks for Subsequent Analytics Processes Using Big Data and Machine Learning
title_full_unstemmed Modeling and Simulation of Cell Signaling Networks for Subsequent Analytics Processes Using Big Data and Machine Learning
title_short Modeling and Simulation of Cell Signaling Networks for Subsequent Analytics Processes Using Big Data and Machine Learning
title_sort modeling and simulation of cell signaling networks for subsequent analytics processes using big data and machine learning
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9036331/
https://www.ncbi.nlm.nih.gov/pubmed/35478994
http://dx.doi.org/10.1177/11779322221091739
work_keys_str_mv AT sanchezgutierrezmaximoeduardo modelingandsimulationofcellsignalingnetworksforsubsequentanalyticsprocessesusingbigdataandmachinelearning
AT gonzalezperezpedropablo modelingandsimulationofcellsignalingnetworksforsubsequentanalyticsprocessesusingbigdataandmachinelearning