Cargando…

Use of Attribute Driven Incremental Discretization and Logic Learning Machine to build a prognostic classifier for neuroblastoma patients

BACKGROUND: Cancer patient's outcome is written, in part, in the gene expression profile of the tumor. We previously identified a 62-probe sets signature (NB-hypo) to identify tissue hypoxia in neuroblastoma tumors and showed that NB-hypo stratified neuroblastoma patients in good and poor outco...

Descripción completa

Detalles Bibliográficos
Autores principales: Cangelosi, Davide, Muselli, Marco, Parodi, Stefano, Blengio, Fabiola, Becherini, Pamela, Versteeg, Rogier, Conte, Massimo, Varesio, Luigi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4095004/
https://www.ncbi.nlm.nih.gov/pubmed/25078098
http://dx.doi.org/10.1186/1471-2105-15-S5-S4
_version_ 1782325936830545920
author Cangelosi, Davide
Muselli, Marco
Parodi, Stefano
Blengio, Fabiola
Becherini, Pamela
Versteeg, Rogier
Conte, Massimo
Varesio, Luigi
author_facet Cangelosi, Davide
Muselli, Marco
Parodi, Stefano
Blengio, Fabiola
Becherini, Pamela
Versteeg, Rogier
Conte, Massimo
Varesio, Luigi
author_sort Cangelosi, Davide
collection PubMed
description BACKGROUND: Cancer patient's outcome is written, in part, in the gene expression profile of the tumor. We previously identified a 62-probe sets signature (NB-hypo) to identify tissue hypoxia in neuroblastoma tumors and showed that NB-hypo stratified neuroblastoma patients in good and poor outcome [1]. It was important to develop a prognostic classifier to cluster patients into risk groups benefiting of defined therapeutic approaches. Novel classification and data discretization approaches can be instrumental for the generation of accurate predictors and robust tools for clinical decision support. We explored the application to gene expression data of Rulex, a novel software suite including the Attribute Driven Incremental Discretization technique for transforming continuous variables into simplified discrete ones and the Logic Learning Machine model for intelligible rule generation. RESULTS: We applied Rulex components to the problem of predicting the outcome of neuroblastoma patients on the bases of 62 probe sets NB-hypo gene expression signature. The resulting classifier consisted in 9 rules utilizing mainly two conditions of the relative expression of 11 probe sets. These rules were very effective predictors, as shown in an independent validation set, demonstrating the validity of the LLM algorithm applied to microarray data and patients' classification. The LLM performed as efficiently as Prediction Analysis of Microarray and Support Vector Machine, and outperformed other learning algorithms such as C4.5. Rulex carried out a feature selection by selecting a new signature (NB-hypo-II) of 11 probe sets that turned out to be the most relevant in predicting outcome among the 62 of the NB-hypo signature. Rules are easily interpretable as they involve only few conditions. Furthermore, we demonstrate that the application of a weighted classification associated with the rules improves the classification of poorly represented classes. CONCLUSIONS: Our findings provided evidence that the application of Rulex to the expression values of NB-hypo signature created a set of accurate, high quality, consistent and interpretable rules for the prediction of neuroblastoma patients' outcome. We identified the Rulex weighted classification as a flexible tool that can support clinical decisions. For these reasons, we consider Rulex to be a useful tool for cancer classification from microarray gene expression data.
format Online
Article
Text
id pubmed-4095004
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40950042014-07-23 Use of Attribute Driven Incremental Discretization and Logic Learning Machine to build a prognostic classifier for neuroblastoma patients Cangelosi, Davide Muselli, Marco Parodi, Stefano Blengio, Fabiola Becherini, Pamela Versteeg, Rogier Conte, Massimo Varesio, Luigi BMC Bioinformatics Research BACKGROUND: Cancer patient's outcome is written, in part, in the gene expression profile of the tumor. We previously identified a 62-probe sets signature (NB-hypo) to identify tissue hypoxia in neuroblastoma tumors and showed that NB-hypo stratified neuroblastoma patients in good and poor outcome [1]. It was important to develop a prognostic classifier to cluster patients into risk groups benefiting of defined therapeutic approaches. Novel classification and data discretization approaches can be instrumental for the generation of accurate predictors and robust tools for clinical decision support. We explored the application to gene expression data of Rulex, a novel software suite including the Attribute Driven Incremental Discretization technique for transforming continuous variables into simplified discrete ones and the Logic Learning Machine model for intelligible rule generation. RESULTS: We applied Rulex components to the problem of predicting the outcome of neuroblastoma patients on the bases of 62 probe sets NB-hypo gene expression signature. The resulting classifier consisted in 9 rules utilizing mainly two conditions of the relative expression of 11 probe sets. These rules were very effective predictors, as shown in an independent validation set, demonstrating the validity of the LLM algorithm applied to microarray data and patients' classification. The LLM performed as efficiently as Prediction Analysis of Microarray and Support Vector Machine, and outperformed other learning algorithms such as C4.5. Rulex carried out a feature selection by selecting a new signature (NB-hypo-II) of 11 probe sets that turned out to be the most relevant in predicting outcome among the 62 of the NB-hypo signature. Rules are easily interpretable as they involve only few conditions. Furthermore, we demonstrate that the application of a weighted classification associated with the rules improves the classification of poorly represented classes. CONCLUSIONS: Our findings provided evidence that the application of Rulex to the expression values of NB-hypo signature created a set of accurate, high quality, consistent and interpretable rules for the prediction of neuroblastoma patients' outcome. We identified the Rulex weighted classification as a flexible tool that can support clinical decisions. For these reasons, we consider Rulex to be a useful tool for cancer classification from microarray gene expression data. BioMed Central 2014-05-06 /pmc/articles/PMC4095004/ /pubmed/25078098 http://dx.doi.org/10.1186/1471-2105-15-S5-S4 Text en Copyright © 2014 Cangelosi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Cangelosi, Davide
Muselli, Marco
Parodi, Stefano
Blengio, Fabiola
Becherini, Pamela
Versteeg, Rogier
Conte, Massimo
Varesio, Luigi
Use of Attribute Driven Incremental Discretization and Logic Learning Machine to build a prognostic classifier for neuroblastoma patients
title Use of Attribute Driven Incremental Discretization and Logic Learning Machine to build a prognostic classifier for neuroblastoma patients
title_full Use of Attribute Driven Incremental Discretization and Logic Learning Machine to build a prognostic classifier for neuroblastoma patients
title_fullStr Use of Attribute Driven Incremental Discretization and Logic Learning Machine to build a prognostic classifier for neuroblastoma patients
title_full_unstemmed Use of Attribute Driven Incremental Discretization and Logic Learning Machine to build a prognostic classifier for neuroblastoma patients
title_short Use of Attribute Driven Incremental Discretization and Logic Learning Machine to build a prognostic classifier for neuroblastoma patients
title_sort use of attribute driven incremental discretization and logic learning machine to build a prognostic classifier for neuroblastoma patients
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4095004/
https://www.ncbi.nlm.nih.gov/pubmed/25078098
http://dx.doi.org/10.1186/1471-2105-15-S5-S4
work_keys_str_mv AT cangelosidavide useofattributedrivenincrementaldiscretizationandlogiclearningmachinetobuildaprognosticclassifierforneuroblastomapatients
AT musellimarco useofattributedrivenincrementaldiscretizationandlogiclearningmachinetobuildaprognosticclassifierforneuroblastomapatients
AT parodistefano useofattributedrivenincrementaldiscretizationandlogiclearningmachinetobuildaprognosticclassifierforneuroblastomapatients
AT blengiofabiola useofattributedrivenincrementaldiscretizationandlogiclearningmachinetobuildaprognosticclassifierforneuroblastomapatients
AT becherinipamela useofattributedrivenincrementaldiscretizationandlogiclearningmachinetobuildaprognosticclassifierforneuroblastomapatients
AT versteegrogier useofattributedrivenincrementaldiscretizationandlogiclearningmachinetobuildaprognosticclassifierforneuroblastomapatients
AT contemassimo useofattributedrivenincrementaldiscretizationandlogiclearningmachinetobuildaprognosticclassifierforneuroblastomapatients
AT varesioluigi useofattributedrivenincrementaldiscretizationandlogiclearningmachinetobuildaprognosticclassifierforneuroblastomapatients