Cargando…
Classification of dendritic cell phenotypes from gene expression data
BACKGROUND: The selection of relevant genes for sample classification is a common task in many gene expression studies. Although a number of tools have been developed to identify optimal gene expression signatures, they often generate gene lists that are too long to be exploited clinically. Conseque...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179938/ https://www.ncbi.nlm.nih.gov/pubmed/21875438 http://dx.doi.org/10.1186/1471-2172-12-50 |
_version_ | 1782212571705638912 |
---|---|
author | Tuana, Giacomo Volpato, Viola Ricciardi-Castagnoli, Paola Zolezzi, Francesca Stella, Fabio Foti, Maria |
author_facet | Tuana, Giacomo Volpato, Viola Ricciardi-Castagnoli, Paola Zolezzi, Francesca Stella, Fabio Foti, Maria |
author_sort | Tuana, Giacomo |
collection | PubMed |
description | BACKGROUND: The selection of relevant genes for sample classification is a common task in many gene expression studies. Although a number of tools have been developed to identify optimal gene expression signatures, they often generate gene lists that are too long to be exploited clinically. Consequently, researchers in the field try to identify the smallest set of genes that provide good sample classification. We investigated the genome-wide expression of the inflammatory phenotype in dendritic cells. Dendritic cells are a complex group of cells that play a critical role in vertebrate immunity. Therefore, the prediction of the inflammatory phenotype in these cells may help with the selection of immune-modulating compounds. RESULTS: A data mining protocol was applied to microarray data for murine cell lines treated with various inflammatory stimuli. The learning and validation data sets consisted of 155 and 49 samples, respectively. The data mining protocol reduced the number of probe sets from 5,802 to 10, then from 10 to 6 and finally from 6 to 3. The performances of a set of supervised classification models were compared. The best accuracy, when using the six following genes --Il12b, Cd40, Socs3, Irgm1, Plin2 and Lgals3bp-- was obtained by Tree Augmented Naïve Bayes and Nearest Neighbour (91.8%). Using the smallest set of three genes --Il12b, Cd40 and Socs3-- the performance remained satisfactory and the best accuracy was with Support Vector Machine (95.9%). These data mining models, using data for the genes Il12b, Cd40 and Socs3, were validated with a human data set consisting of 27 samples. Support Vector Machines (71.4%) and Nearest Neighbour (92.6%) gave the worst performances, but the remaining models correctly classified all the 27 samples. CONCLUSIONS: The genes selected by the data mining protocol proposed were shown to be informative for discriminating between inflammatory and steady-state phenotypes in dendritic cells. The robustness of the data mining protocol was confirmed by the accuracy for a human data set, when using only the following three genes: Il12b, Cd40 and Socs3. In summary, we analysed the longitudinal pattern of expression in dendritic cells stimulated with activating agents with the aim of identifying signatures that would predict or explain the dentritic cell response to an inflammatory agent. |
format | Online Article Text |
id | pubmed-3179938 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31799382011-09-26 Classification of dendritic cell phenotypes from gene expression data Tuana, Giacomo Volpato, Viola Ricciardi-Castagnoli, Paola Zolezzi, Francesca Stella, Fabio Foti, Maria BMC Immunol Research Article BACKGROUND: The selection of relevant genes for sample classification is a common task in many gene expression studies. Although a number of tools have been developed to identify optimal gene expression signatures, they often generate gene lists that are too long to be exploited clinically. Consequently, researchers in the field try to identify the smallest set of genes that provide good sample classification. We investigated the genome-wide expression of the inflammatory phenotype in dendritic cells. Dendritic cells are a complex group of cells that play a critical role in vertebrate immunity. Therefore, the prediction of the inflammatory phenotype in these cells may help with the selection of immune-modulating compounds. RESULTS: A data mining protocol was applied to microarray data for murine cell lines treated with various inflammatory stimuli. The learning and validation data sets consisted of 155 and 49 samples, respectively. The data mining protocol reduced the number of probe sets from 5,802 to 10, then from 10 to 6 and finally from 6 to 3. The performances of a set of supervised classification models were compared. The best accuracy, when using the six following genes --Il12b, Cd40, Socs3, Irgm1, Plin2 and Lgals3bp-- was obtained by Tree Augmented Naïve Bayes and Nearest Neighbour (91.8%). Using the smallest set of three genes --Il12b, Cd40 and Socs3-- the performance remained satisfactory and the best accuracy was with Support Vector Machine (95.9%). These data mining models, using data for the genes Il12b, Cd40 and Socs3, were validated with a human data set consisting of 27 samples. Support Vector Machines (71.4%) and Nearest Neighbour (92.6%) gave the worst performances, but the remaining models correctly classified all the 27 samples. CONCLUSIONS: The genes selected by the data mining protocol proposed were shown to be informative for discriminating between inflammatory and steady-state phenotypes in dendritic cells. The robustness of the data mining protocol was confirmed by the accuracy for a human data set, when using only the following three genes: Il12b, Cd40 and Socs3. In summary, we analysed the longitudinal pattern of expression in dendritic cells stimulated with activating agents with the aim of identifying signatures that would predict or explain the dentritic cell response to an inflammatory agent. BioMed Central 2011-08-29 /pmc/articles/PMC3179938/ /pubmed/21875438 http://dx.doi.org/10.1186/1471-2172-12-50 Text en Copyright ©2011 Tuana et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Tuana, Giacomo Volpato, Viola Ricciardi-Castagnoli, Paola Zolezzi, Francesca Stella, Fabio Foti, Maria Classification of dendritic cell phenotypes from gene expression data |
title | Classification of dendritic cell phenotypes from gene expression data |
title_full | Classification of dendritic cell phenotypes from gene expression data |
title_fullStr | Classification of dendritic cell phenotypes from gene expression data |
title_full_unstemmed | Classification of dendritic cell phenotypes from gene expression data |
title_short | Classification of dendritic cell phenotypes from gene expression data |
title_sort | classification of dendritic cell phenotypes from gene expression data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179938/ https://www.ncbi.nlm.nih.gov/pubmed/21875438 http://dx.doi.org/10.1186/1471-2172-12-50 |
work_keys_str_mv | AT tuanagiacomo classificationofdendriticcellphenotypesfromgeneexpressiondata AT volpatoviola classificationofdendriticcellphenotypesfromgeneexpressiondata AT ricciardicastagnolipaola classificationofdendriticcellphenotypesfromgeneexpressiondata AT zolezzifrancesca classificationofdendriticcellphenotypesfromgeneexpressiondata AT stellafabio classificationofdendriticcellphenotypesfromgeneexpressiondata AT fotimaria classificationofdendriticcellphenotypesfromgeneexpressiondata |