Cargando…

Classification of dendritic cell phenotypes from gene expression data

BACKGROUND: The selection of relevant genes for sample classification is a common task in many gene expression studies. Although a number of tools have been developed to identify optimal gene expression signatures, they often generate gene lists that are too long to be exploited clinically. Conseque...

Descripción completa

Detalles Bibliográficos
Autores principales: Tuana, Giacomo, Volpato, Viola, Ricciardi-Castagnoli, Paola, Zolezzi, Francesca, Stella, Fabio, Foti, Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179938/
https://www.ncbi.nlm.nih.gov/pubmed/21875438
http://dx.doi.org/10.1186/1471-2172-12-50
_version_ 1782212571705638912
author Tuana, Giacomo
Volpato, Viola
Ricciardi-Castagnoli, Paola
Zolezzi, Francesca
Stella, Fabio
Foti, Maria
author_facet Tuana, Giacomo
Volpato, Viola
Ricciardi-Castagnoli, Paola
Zolezzi, Francesca
Stella, Fabio
Foti, Maria
author_sort Tuana, Giacomo
collection PubMed
description BACKGROUND: The selection of relevant genes for sample classification is a common task in many gene expression studies. Although a number of tools have been developed to identify optimal gene expression signatures, they often generate gene lists that are too long to be exploited clinically. Consequently, researchers in the field try to identify the smallest set of genes that provide good sample classification. We investigated the genome-wide expression of the inflammatory phenotype in dendritic cells. Dendritic cells are a complex group of cells that play a critical role in vertebrate immunity. Therefore, the prediction of the inflammatory phenotype in these cells may help with the selection of immune-modulating compounds. RESULTS: A data mining protocol was applied to microarray data for murine cell lines treated with various inflammatory stimuli. The learning and validation data sets consisted of 155 and 49 samples, respectively. The data mining protocol reduced the number of probe sets from 5,802 to 10, then from 10 to 6 and finally from 6 to 3. The performances of a set of supervised classification models were compared. The best accuracy, when using the six following genes --Il12b, Cd40, Socs3, Irgm1, Plin2 and Lgals3bp-- was obtained by Tree Augmented Naïve Bayes and Nearest Neighbour (91.8%). Using the smallest set of three genes --Il12b, Cd40 and Socs3-- the performance remained satisfactory and the best accuracy was with Support Vector Machine (95.9%). These data mining models, using data for the genes Il12b, Cd40 and Socs3, were validated with a human data set consisting of 27 samples. Support Vector Machines (71.4%) and Nearest Neighbour (92.6%) gave the worst performances, but the remaining models correctly classified all the 27 samples. CONCLUSIONS: The genes selected by the data mining protocol proposed were shown to be informative for discriminating between inflammatory and steady-state phenotypes in dendritic cells. The robustness of the data mining protocol was confirmed by the accuracy for a human data set, when using only the following three genes: Il12b, Cd40 and Socs3. In summary, we analysed the longitudinal pattern of expression in dendritic cells stimulated with activating agents with the aim of identifying signatures that would predict or explain the dentritic cell response to an inflammatory agent.
format Online
Article
Text
id pubmed-3179938
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31799382011-09-26 Classification of dendritic cell phenotypes from gene expression data Tuana, Giacomo Volpato, Viola Ricciardi-Castagnoli, Paola Zolezzi, Francesca Stella, Fabio Foti, Maria BMC Immunol Research Article BACKGROUND: The selection of relevant genes for sample classification is a common task in many gene expression studies. Although a number of tools have been developed to identify optimal gene expression signatures, they often generate gene lists that are too long to be exploited clinically. Consequently, researchers in the field try to identify the smallest set of genes that provide good sample classification. We investigated the genome-wide expression of the inflammatory phenotype in dendritic cells. Dendritic cells are a complex group of cells that play a critical role in vertebrate immunity. Therefore, the prediction of the inflammatory phenotype in these cells may help with the selection of immune-modulating compounds. RESULTS: A data mining protocol was applied to microarray data for murine cell lines treated with various inflammatory stimuli. The learning and validation data sets consisted of 155 and 49 samples, respectively. The data mining protocol reduced the number of probe sets from 5,802 to 10, then from 10 to 6 and finally from 6 to 3. The performances of a set of supervised classification models were compared. The best accuracy, when using the six following genes --Il12b, Cd40, Socs3, Irgm1, Plin2 and Lgals3bp-- was obtained by Tree Augmented Naïve Bayes and Nearest Neighbour (91.8%). Using the smallest set of three genes --Il12b, Cd40 and Socs3-- the performance remained satisfactory and the best accuracy was with Support Vector Machine (95.9%). These data mining models, using data for the genes Il12b, Cd40 and Socs3, were validated with a human data set consisting of 27 samples. Support Vector Machines (71.4%) and Nearest Neighbour (92.6%) gave the worst performances, but the remaining models correctly classified all the 27 samples. CONCLUSIONS: The genes selected by the data mining protocol proposed were shown to be informative for discriminating between inflammatory and steady-state phenotypes in dendritic cells. The robustness of the data mining protocol was confirmed by the accuracy for a human data set, when using only the following three genes: Il12b, Cd40 and Socs3. In summary, we analysed the longitudinal pattern of expression in dendritic cells stimulated with activating agents with the aim of identifying signatures that would predict or explain the dentritic cell response to an inflammatory agent. BioMed Central 2011-08-29 /pmc/articles/PMC3179938/ /pubmed/21875438 http://dx.doi.org/10.1186/1471-2172-12-50 Text en Copyright ©2011 Tuana et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Tuana, Giacomo
Volpato, Viola
Ricciardi-Castagnoli, Paola
Zolezzi, Francesca
Stella, Fabio
Foti, Maria
Classification of dendritic cell phenotypes from gene expression data
title Classification of dendritic cell phenotypes from gene expression data
title_full Classification of dendritic cell phenotypes from gene expression data
title_fullStr Classification of dendritic cell phenotypes from gene expression data
title_full_unstemmed Classification of dendritic cell phenotypes from gene expression data
title_short Classification of dendritic cell phenotypes from gene expression data
title_sort classification of dendritic cell phenotypes from gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179938/
https://www.ncbi.nlm.nih.gov/pubmed/21875438
http://dx.doi.org/10.1186/1471-2172-12-50
work_keys_str_mv AT tuanagiacomo classificationofdendriticcellphenotypesfromgeneexpressiondata
AT volpatoviola classificationofdendriticcellphenotypesfromgeneexpressiondata
AT ricciardicastagnolipaola classificationofdendriticcellphenotypesfromgeneexpressiondata
AT zolezzifrancesca classificationofdendriticcellphenotypesfromgeneexpressiondata
AT stellafabio classificationofdendriticcellphenotypesfromgeneexpressiondata
AT fotimaria classificationofdendriticcellphenotypesfromgeneexpressiondata