Cargando…

Current composite-feature classification methods do not outperform simple single-genes classifiers in breast cancer prognosis

Integrating gene expression data with secondary data such as pathway or protein-protein interaction data has been proposed as a promising approach for improved outcome prediction of cancer patients. Methods employing this approach usually aggregate the expression of genes into new composite features...

Descripción completa

Detalles Bibliográficos
Autores principales: Staiger, Christine, Cadot, Sidney, Györffy, Balázs, Wessels, Lodewyk F. A., Klau, Gunnar W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3870302/
https://www.ncbi.nlm.nih.gov/pubmed/24391662
http://dx.doi.org/10.3389/fgene.2013.00289
_version_ 1782296690338824192
author Staiger, Christine
Cadot, Sidney
Györffy, Balázs
Wessels, Lodewyk F. A.
Klau, Gunnar W.
author_facet Staiger, Christine
Cadot, Sidney
Györffy, Balázs
Wessels, Lodewyk F. A.
Klau, Gunnar W.
author_sort Staiger, Christine
collection PubMed
description Integrating gene expression data with secondary data such as pathway or protein-protein interaction data has been proposed as a promising approach for improved outcome prediction of cancer patients. Methods employing this approach usually aggregate the expression of genes into new composite features, while the secondary data guide this aggregation. Previous studies were limited to few data sets with a small number of patients. Moreover, each study used different data and evaluation procedures. This makes it difficult to objectively assess the gain in classification performance. Here we introduce the Amsterdam Classification Evaluation Suite (ACES). ACES is a Python package to objectively evaluate classification and feature-selection methods and contains methods for pooling and normalizing Affymetrix microarrays from different studies. It is simple to use and therefore facilitates the comparison of new approaches to best-in-class approaches. In addition to the methods described in our earlier study (Staiger et al., 2012), we have included two prominent prognostic gene signatures specific for breast cancer outcome, one more composite feature selection method and two network-based gene ranking methods. Employing the evaluation pipeline we show that current composite-feature classification methods do not outperform simple single-genes classifiers in predicting outcome in breast cancer. Furthermore, we find that also the stability of features across different data sets is not higher for composite features. Most stunningly, we observe that prediction performances are not affected when extracting features from randomized PPI networks.
format Online
Article
Text
id pubmed-3870302
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-38703022014-01-03 Current composite-feature classification methods do not outperform simple single-genes classifiers in breast cancer prognosis Staiger, Christine Cadot, Sidney Györffy, Balázs Wessels, Lodewyk F. A. Klau, Gunnar W. Front Genet Genetics Integrating gene expression data with secondary data such as pathway or protein-protein interaction data has been proposed as a promising approach for improved outcome prediction of cancer patients. Methods employing this approach usually aggregate the expression of genes into new composite features, while the secondary data guide this aggregation. Previous studies were limited to few data sets with a small number of patients. Moreover, each study used different data and evaluation procedures. This makes it difficult to objectively assess the gain in classification performance. Here we introduce the Amsterdam Classification Evaluation Suite (ACES). ACES is a Python package to objectively evaluate classification and feature-selection methods and contains methods for pooling and normalizing Affymetrix microarrays from different studies. It is simple to use and therefore facilitates the comparison of new approaches to best-in-class approaches. In addition to the methods described in our earlier study (Staiger et al., 2012), we have included two prominent prognostic gene signatures specific for breast cancer outcome, one more composite feature selection method and two network-based gene ranking methods. Employing the evaluation pipeline we show that current composite-feature classification methods do not outperform simple single-genes classifiers in predicting outcome in breast cancer. Furthermore, we find that also the stability of features across different data sets is not higher for composite features. Most stunningly, we observe that prediction performances are not affected when extracting features from randomized PPI networks. Frontiers Media S.A. 2013-12-23 /pmc/articles/PMC3870302/ /pubmed/24391662 http://dx.doi.org/10.3389/fgene.2013.00289 Text en Copyright © 2013 Staiger, Cadot, Györffy, Wessels and Klau. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Staiger, Christine
Cadot, Sidney
Györffy, Balázs
Wessels, Lodewyk F. A.
Klau, Gunnar W.
Current composite-feature classification methods do not outperform simple single-genes classifiers in breast cancer prognosis
title Current composite-feature classification methods do not outperform simple single-genes classifiers in breast cancer prognosis
title_full Current composite-feature classification methods do not outperform simple single-genes classifiers in breast cancer prognosis
title_fullStr Current composite-feature classification methods do not outperform simple single-genes classifiers in breast cancer prognosis
title_full_unstemmed Current composite-feature classification methods do not outperform simple single-genes classifiers in breast cancer prognosis
title_short Current composite-feature classification methods do not outperform simple single-genes classifiers in breast cancer prognosis
title_sort current composite-feature classification methods do not outperform simple single-genes classifiers in breast cancer prognosis
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3870302/
https://www.ncbi.nlm.nih.gov/pubmed/24391662
http://dx.doi.org/10.3389/fgene.2013.00289
work_keys_str_mv AT staigerchristine currentcompositefeatureclassificationmethodsdonotoutperformsimplesinglegenesclassifiersinbreastcancerprognosis
AT cadotsidney currentcompositefeatureclassificationmethodsdonotoutperformsimplesinglegenesclassifiersinbreastcancerprognosis
AT gyorffybalazs currentcompositefeatureclassificationmethodsdonotoutperformsimplesinglegenesclassifiersinbreastcancerprognosis
AT wesselslodewykfa currentcompositefeatureclassificationmethodsdonotoutperformsimplesinglegenesclassifiersinbreastcancerprognosis
AT klaugunnarw currentcompositefeatureclassificationmethodsdonotoutperformsimplesinglegenesclassifiersinbreastcancerprognosis