Cargando…

Supervised learning methods in modeling of CD4+ T cell heterogeneity

BACKGROUND: Modeling of the immune system – a highly non-linear and complex system – requires practical and efficient data analytic approaches. The immune system is composed of heterogeneous cell populations and hundreds of cell types, such as neutrophils, eosinophils, macrophages, dendritic cells,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lu, Pinyi, Abedi, Vida, Mei, Yongguo, Hontecillas, Raquel, Hoops, Stefan, Carbo, Adria, Bassaganya-Riera, Josep
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2015
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559362/ https://www.ncbi.nlm.nih.gov/pubmed/26339293 http://dx.doi.org/10.1186/s13040-015-0060-6

_version_	1782388761659703296
author	Lu, Pinyi Abedi, Vida Mei, Yongguo Hontecillas, Raquel Hoops, Stefan Carbo, Adria Bassaganya-Riera, Josep
author_facet	Lu, Pinyi Abedi, Vida Mei, Yongguo Hontecillas, Raquel Hoops, Stefan Carbo, Adria Bassaganya-Riera, Josep
author_sort	Lu, Pinyi
collection	PubMed
description	BACKGROUND: Modeling of the immune system – a highly non-linear and complex system – requires practical and efficient data analytic approaches. The immune system is composed of heterogeneous cell populations and hundreds of cell types, such as neutrophils, eosinophils, macrophages, dendritic cells, T cells, and B cells. Each cell type is highly diverse and can be further differentiated into subsets with unique and overlapping functions. For example, CD4+ T cells can be differentiated into Th1, Th2, Th17, Th9, Th22, Treg, Tfh, as well as Tr1. Each subset plays different roles in the immune system. To study molecular mechanisms of cell differentiation, computational systems biology approaches can be used to represent these processes; however, the latter often requires building complex intracellular signaling models with a large number of equations to accurately represent intracellular pathways and biochemical reactions. Furthermore, studying the immune system entails integration of complex processes which occur at different time and space scales. METHODS: This study presents and compares four supervised learning methods for modeling CD4+ T cell differentiation: Artificial Neural Networks (ANN), Random Forest (RF), Support Vector Machines (SVM), and Linear Regression (LR). Application of supervised learning methods could reduce the complexity of Ordinary Differential Equations (ODEs)-based intracellular models by only focusing on the input and output cytokine concentrations. In addition, this modeling framework can be efficiently integrated into multiscale models. RESULTS: Our results demonstrate that ANN and RF outperform the other two methods. Furthermore, ANN and RF have comparable performance when applied to in silico data with and without added noise. The trained models were also able to reproduce dynamic behavior when applied to experimental data; in four out of five cases, model predictions based on ANN and RF correctly predicted the outcome of the system. Finally, the running time of different methods was compared, which confirms that ANN is considerably faster than RF. CONCLUSIONS: Using machine learning as opposed to ODE-based method reduces the computational complexity of the system and allows one to gain a deeper understanding of the complex interplay between the different related entities.
format	Online Article Text
id	pubmed-4559362
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-45593622015-09-04 Supervised learning methods in modeling of CD4+ T cell heterogeneity Lu, Pinyi Abedi, Vida Mei, Yongguo Hontecillas, Raquel Hoops, Stefan Carbo, Adria Bassaganya-Riera, Josep BioData Min Research BACKGROUND: Modeling of the immune system – a highly non-linear and complex system – requires practical and efficient data analytic approaches. The immune system is composed of heterogeneous cell populations and hundreds of cell types, such as neutrophils, eosinophils, macrophages, dendritic cells, T cells, and B cells. Each cell type is highly diverse and can be further differentiated into subsets with unique and overlapping functions. For example, CD4+ T cells can be differentiated into Th1, Th2, Th17, Th9, Th22, Treg, Tfh, as well as Tr1. Each subset plays different roles in the immune system. To study molecular mechanisms of cell differentiation, computational systems biology approaches can be used to represent these processes; however, the latter often requires building complex intracellular signaling models with a large number of equations to accurately represent intracellular pathways and biochemical reactions. Furthermore, studying the immune system entails integration of complex processes which occur at different time and space scales. METHODS: This study presents and compares four supervised learning methods for modeling CD4+ T cell differentiation: Artificial Neural Networks (ANN), Random Forest (RF), Support Vector Machines (SVM), and Linear Regression (LR). Application of supervised learning methods could reduce the complexity of Ordinary Differential Equations (ODEs)-based intracellular models by only focusing on the input and output cytokine concentrations. In addition, this modeling framework can be efficiently integrated into multiscale models. RESULTS: Our results demonstrate that ANN and RF outperform the other two methods. Furthermore, ANN and RF have comparable performance when applied to in silico data with and without added noise. The trained models were also able to reproduce dynamic behavior when applied to experimental data; in four out of five cases, model predictions based on ANN and RF correctly predicted the outcome of the system. Finally, the running time of different methods was compared, which confirms that ANN is considerably faster than RF. CONCLUSIONS: Using machine learning as opposed to ODE-based method reduces the computational complexity of the system and allows one to gain a deeper understanding of the complex interplay between the different related entities. BioMed Central 2015-09-04 /pmc/articles/PMC4559362/ /pubmed/26339293 http://dx.doi.org/10.1186/s13040-015-0060-6 Text en © Lu et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Lu, Pinyi Abedi, Vida Mei, Yongguo Hontecillas, Raquel Hoops, Stefan Carbo, Adria Bassaganya-Riera, Josep Supervised learning methods in modeling of CD4+ T cell heterogeneity
title	Supervised learning methods in modeling of CD4+ T cell heterogeneity
title_full	Supervised learning methods in modeling of CD4+ T cell heterogeneity
title_fullStr	Supervised learning methods in modeling of CD4+ T cell heterogeneity
title_full_unstemmed	Supervised learning methods in modeling of CD4+ T cell heterogeneity
title_short	Supervised learning methods in modeling of CD4+ T cell heterogeneity
title_sort	supervised learning methods in modeling of cd4+ t cell heterogeneity
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559362/ https://www.ncbi.nlm.nih.gov/pubmed/26339293 http://dx.doi.org/10.1186/s13040-015-0060-6
work_keys_str_mv	AT lupinyi supervisedlearningmethodsinmodelingofcd4tcellheterogeneity AT abedivida supervisedlearningmethodsinmodelingofcd4tcellheterogeneity AT meiyongguo supervisedlearningmethodsinmodelingofcd4tcellheterogeneity AT hontecillasraquel supervisedlearningmethodsinmodelingofcd4tcellheterogeneity AT hoopsstefan supervisedlearningmethodsinmodelingofcd4tcellheterogeneity AT carboadria supervisedlearningmethodsinmodelingofcd4tcellheterogeneity AT bassaganyarierajosep supervisedlearningmethodsinmodelingofcd4tcellheterogeneity

Supervised learning methods in modeling of CD4+ T cell heterogeneity

Ejemplares similares