Cargando…

Comparison of Predicted Probabilities of Proportional Hazards Regression and Linear Discriminant Analysis Methods Using a Colorectal Cancer Molecular Biomarker Database

BACKGROUND: Although a majority of studies in cancer biomarker discovery claim to use proportional hazards regression (PHREG) to the study the ability of a biomarker to predict survival, few studies use the predicted probabilities obtained from the model to test the quality of the model. In this pap...

Descripción completa

Detalles Bibliográficos
Autores principales: Meleth, Sreelatha, Chatla, Chakrapani, Katkoori, Venkat R., Anderson, Billie, Hardin, James M., Jhala, Nirag C., Bartolucci, Al, Grizzle, William E., Manne, Upender
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675853/
https://www.ncbi.nlm.nih.gov/pubmed/19455238
_version_ 1782166721288732672
author Meleth, Sreelatha
Chatla, Chakrapani
Katkoori, Venkat R.
Anderson, Billie
Hardin, James M.
Jhala, Nirag C.
Bartolucci, Al
Grizzle, William E.
Manne, Upender
author_facet Meleth, Sreelatha
Chatla, Chakrapani
Katkoori, Venkat R.
Anderson, Billie
Hardin, James M.
Jhala, Nirag C.
Bartolucci, Al
Grizzle, William E.
Manne, Upender
author_sort Meleth, Sreelatha
collection PubMed
description BACKGROUND: Although a majority of studies in cancer biomarker discovery claim to use proportional hazards regression (PHREG) to the study the ability of a biomarker to predict survival, few studies use the predicted probabilities obtained from the model to test the quality of the model. In this paper, we compared the quality of predictions by a PHREG model to that of a linear discriminant analysis (LDA) in both training and test set settings. METHODS: The PHREG and LDA models were built on a 491 colorectal cancer (CRC) patient dataset comprised of demographic and clinicopathologic variables, and phenotypic expression of p53 and Bcl-2. Two variable selection methods, stepwise discriminant analysis and the backward selection, were used to identify the final models. The endpoint of prediction in these models was five-year post-surgery survival. We also used linear regression model to examine the effect of bin size in the training set on the accuracy of prediction in the test set. RESULTS: The two variable selection techniques resulted in different models when stage was included in the list of variables available for selection. However, the proportion of survivors and non-survivors correctly identified was identical in both of these models. When stage was excluded from the variable list, the error rate for the LDA model was 42% as compared to an error rate of 34% for the PHREG model. CONCLUSIONS: This study suggests that a PHREG model can perform as well or better than a traditional classifier such as LDA to classify patients into prognostic classes. Also, this study suggests that in the absence of the tumor stage as a variable, Bcl-2 expression is a strong prognostic molecular marker of CRC.
format Text
id pubmed-2675853
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26758532009-05-19 Comparison of Predicted Probabilities of Proportional Hazards Regression and Linear Discriminant Analysis Methods Using a Colorectal Cancer Molecular Biomarker Database Meleth, Sreelatha Chatla, Chakrapani Katkoori, Venkat R. Anderson, Billie Hardin, James M. Jhala, Nirag C. Bartolucci, Al Grizzle, William E. Manne, Upender Cancer Inform Original Research BACKGROUND: Although a majority of studies in cancer biomarker discovery claim to use proportional hazards regression (PHREG) to the study the ability of a biomarker to predict survival, few studies use the predicted probabilities obtained from the model to test the quality of the model. In this paper, we compared the quality of predictions by a PHREG model to that of a linear discriminant analysis (LDA) in both training and test set settings. METHODS: The PHREG and LDA models were built on a 491 colorectal cancer (CRC) patient dataset comprised of demographic and clinicopathologic variables, and phenotypic expression of p53 and Bcl-2. Two variable selection methods, stepwise discriminant analysis and the backward selection, were used to identify the final models. The endpoint of prediction in these models was five-year post-surgery survival. We also used linear regression model to examine the effect of bin size in the training set on the accuracy of prediction in the test set. RESULTS: The two variable selection techniques resulted in different models when stage was included in the list of variables available for selection. However, the proportion of survivors and non-survivors correctly identified was identical in both of these models. When stage was excluded from the variable list, the error rate for the LDA model was 42% as compared to an error rate of 34% for the PHREG model. CONCLUSIONS: This study suggests that a PHREG model can perform as well or better than a traditional classifier such as LDA to classify patients into prognostic classes. Also, this study suggests that in the absence of the tumor stage as a variable, Bcl-2 expression is a strong prognostic molecular marker of CRC. Libertas Academica 2007-03-02 /pmc/articles/PMC2675853/ /pubmed/19455238 Text en © 2007 The authors. http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Original Research
Meleth, Sreelatha
Chatla, Chakrapani
Katkoori, Venkat R.
Anderson, Billie
Hardin, James M.
Jhala, Nirag C.
Bartolucci, Al
Grizzle, William E.
Manne, Upender
Comparison of Predicted Probabilities of Proportional Hazards Regression and Linear Discriminant Analysis Methods Using a Colorectal Cancer Molecular Biomarker Database
title Comparison of Predicted Probabilities of Proportional Hazards Regression and Linear Discriminant Analysis Methods Using a Colorectal Cancer Molecular Biomarker Database
title_full Comparison of Predicted Probabilities of Proportional Hazards Regression and Linear Discriminant Analysis Methods Using a Colorectal Cancer Molecular Biomarker Database
title_fullStr Comparison of Predicted Probabilities of Proportional Hazards Regression and Linear Discriminant Analysis Methods Using a Colorectal Cancer Molecular Biomarker Database
title_full_unstemmed Comparison of Predicted Probabilities of Proportional Hazards Regression and Linear Discriminant Analysis Methods Using a Colorectal Cancer Molecular Biomarker Database
title_short Comparison of Predicted Probabilities of Proportional Hazards Regression and Linear Discriminant Analysis Methods Using a Colorectal Cancer Molecular Biomarker Database
title_sort comparison of predicted probabilities of proportional hazards regression and linear discriminant analysis methods using a colorectal cancer molecular biomarker database
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675853/
https://www.ncbi.nlm.nih.gov/pubmed/19455238
work_keys_str_mv AT melethsreelatha comparisonofpredictedprobabilitiesofproportionalhazardsregressionandlineardiscriminantanalysismethodsusingacolorectalcancermolecularbiomarkerdatabase
AT chatlachakrapani comparisonofpredictedprobabilitiesofproportionalhazardsregressionandlineardiscriminantanalysismethodsusingacolorectalcancermolecularbiomarkerdatabase
AT katkoorivenkatr comparisonofpredictedprobabilitiesofproportionalhazardsregressionandlineardiscriminantanalysismethodsusingacolorectalcancermolecularbiomarkerdatabase
AT andersonbillie comparisonofpredictedprobabilitiesofproportionalhazardsregressionandlineardiscriminantanalysismethodsusingacolorectalcancermolecularbiomarkerdatabase
AT hardinjamesm comparisonofpredictedprobabilitiesofproportionalhazardsregressionandlineardiscriminantanalysismethodsusingacolorectalcancermolecularbiomarkerdatabase
AT jhalaniragc comparisonofpredictedprobabilitiesofproportionalhazardsregressionandlineardiscriminantanalysismethodsusingacolorectalcancermolecularbiomarkerdatabase
AT bartoluccial comparisonofpredictedprobabilitiesofproportionalhazardsregressionandlineardiscriminantanalysismethodsusingacolorectalcancermolecularbiomarkerdatabase
AT grizzlewilliame comparisonofpredictedprobabilitiesofproportionalhazardsregressionandlineardiscriminantanalysismethodsusingacolorectalcancermolecularbiomarkerdatabase
AT manneupender comparisonofpredictedprobabilitiesofproportionalhazardsregressionandlineardiscriminantanalysismethodsusingacolorectalcancermolecularbiomarkerdatabase