Cargando…

A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?

Motivation: Survival prediction of breast cancer (BC) patients independently of treatment, also known as prognostication, is a complex task since clinically similar breast tumors, in addition to be molecularly heterogeneous, may exhibit different clinical outcomes. In recent years, the analysis of g...

Descripción completa

Detalles Bibliográficos
Autores principales: Haibe-Kains, B., Desmedt, C., Sotiriou, C., Bontempi, G.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553442/
https://www.ncbi.nlm.nih.gov/pubmed/18635567
http://dx.doi.org/10.1093/bioinformatics/btn374
_version_ 1782159507706609664
author Haibe-Kains, B.
Desmedt, C.
Sotiriou, C.
Bontempi, G.
author_facet Haibe-Kains, B.
Desmedt, C.
Sotiriou, C.
Bontempi, G.
author_sort Haibe-Kains, B.
collection PubMed
description Motivation: Survival prediction of breast cancer (BC) patients independently of treatment, also known as prognostication, is a complex task since clinically similar breast tumors, in addition to be molecularly heterogeneous, may exhibit different clinical outcomes. In recent years, the analysis of gene expression profiles by means of sophisticated data mining tools emerged as a promising technology to bring additional insights into BC biology and to improve the quality of prognostication. The aim of this work is to assess quantitatively the accuracy of prediction obtained with state-of-the-art data analysis techniques for BC microarray data through an independent and thorough framework. Results: Due to the large number of variables, the reduced amount of samples and the high degree of noise, complex prediction methods are highly exposed to performance degradation despite the use of cross-validation techniques. Our analysis shows that the most complex methods are not significantly better than the simplest one, a univariate model relying on a single proliferation gene. This result suggests that proliferation might be the most relevant biological process for BC prognostication and that the loss of interpretability deriving from the use of overcomplex methods may be not sufficiently counterbalanced by an improvement of the quality of prediction. Availability: The comparison study is implemented in an R package called survcomp and is available from http://www.ulb.ac.be/di/map/bhaibeka/software/survcomp/. Contact: bhaibeka@ulb.ac.be Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2553442
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-25534422009-02-25 A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all? Haibe-Kains, B. Desmedt, C. Sotiriou, C. Bontempi, G. Bioinformatics Original Papers Motivation: Survival prediction of breast cancer (BC) patients independently of treatment, also known as prognostication, is a complex task since clinically similar breast tumors, in addition to be molecularly heterogeneous, may exhibit different clinical outcomes. In recent years, the analysis of gene expression profiles by means of sophisticated data mining tools emerged as a promising technology to bring additional insights into BC biology and to improve the quality of prognostication. The aim of this work is to assess quantitatively the accuracy of prediction obtained with state-of-the-art data analysis techniques for BC microarray data through an independent and thorough framework. Results: Due to the large number of variables, the reduced amount of samples and the high degree of noise, complex prediction methods are highly exposed to performance degradation despite the use of cross-validation techniques. Our analysis shows that the most complex methods are not significantly better than the simplest one, a univariate model relying on a single proliferation gene. This result suggests that proliferation might be the most relevant biological process for BC prognostication and that the loss of interpretability deriving from the use of overcomplex methods may be not sufficiently counterbalanced by an improvement of the quality of prediction. Availability: The comparison study is implemented in an R package called survcomp and is available from http://www.ulb.ac.be/di/map/bhaibeka/software/survcomp/. Contact: bhaibeka@ulb.ac.be Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2008-10-01 2008-07-17 /pmc/articles/PMC2553442/ /pubmed/18635567 http://dx.doi.org/10.1093/bioinformatics/btn374 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Haibe-Kains, B.
Desmedt, C.
Sotiriou, C.
Bontempi, G.
A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?
title A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?
title_full A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?
title_fullStr A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?
title_full_unstemmed A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?
title_short A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?
title_sort comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all?
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553442/
https://www.ncbi.nlm.nih.gov/pubmed/18635567
http://dx.doi.org/10.1093/bioinformatics/btn374
work_keys_str_mv AT haibekainsb acomparativestudyofsurvivalmodelsforbreastcancerprognosticationbasedonmicroarraydatadoesasinglegenebeatthemall
AT desmedtc acomparativestudyofsurvivalmodelsforbreastcancerprognosticationbasedonmicroarraydatadoesasinglegenebeatthemall
AT sotiriouc acomparativestudyofsurvivalmodelsforbreastcancerprognosticationbasedonmicroarraydatadoesasinglegenebeatthemall
AT bontempig acomparativestudyofsurvivalmodelsforbreastcancerprognosticationbasedonmicroarraydatadoesasinglegenebeatthemall
AT haibekainsb comparativestudyofsurvivalmodelsforbreastcancerprognosticationbasedonmicroarraydatadoesasinglegenebeatthemall
AT desmedtc comparativestudyofsurvivalmodelsforbreastcancerprognosticationbasedonmicroarraydatadoesasinglegenebeatthemall
AT sotiriouc comparativestudyofsurvivalmodelsforbreastcancerprognosticationbasedonmicroarraydatadoesasinglegenebeatthemall
AT bontempig comparativestudyofsurvivalmodelsforbreastcancerprognosticationbasedonmicroarraydatadoesasinglegenebeatthemall