Cargando…

Doubly Optimized Calibrated Support Vector Machine (DOC-SVM): An Algorithm for Joint Optimization of Discrimination and Calibration

Historically, probabilistic models for decision support have focused on discrimination, e.g., minimizing the ranking error of predicted outcomes. Unfortunately, these models ignore another important aspect, calibration, which indicates the magnitude of correctness of model predictions. Using discrim...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Xiaoqian, Menon, Aditya, Wang, Shuang, Kim, Jihoon, Ohno-Machado, Lucila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3490990/
https://www.ncbi.nlm.nih.gov/pubmed/23139819
http://dx.doi.org/10.1371/journal.pone.0048823
Descripción
Sumario:Historically, probabilistic models for decision support have focused on discrimination, e.g., minimizing the ranking error of predicted outcomes. Unfortunately, these models ignore another important aspect, calibration, which indicates the magnitude of correctness of model predictions. Using discrimination and calibration simultaneously can be helpful for many clinical decisions. We investigated tradeoffs between these goals, and developed a unified maximum-margin method to handle them jointly. Our approach called, Doubly Optimized Calibrated Support Vector Machine (DOC-SVM), concurrently optimizes two loss functions: the ridge regression loss and the hinge loss. Experiments using three breast cancer gene-expression datasets (i.e., GSE2034, GSE2990, and Chanrion's datasets) showed that our model generated more calibrated outputs when compared to other state-of-the-art models like Support Vector Machine ([Image: see text] = 0.03, [Image: see text] = 0.13, and [Image: see text]<0.001) and Logistic Regression ([Image: see text] = 0.006, [Image: see text] = 0.008, and [Image: see text]<0.001). DOC-SVM also demonstrated better discrimination (i.e., higher AUCs) when compared to Support Vector Machine ([Image: see text] = 0.38, [Image: see text] = 0.29, and [Image: see text] = 0.047) and Logistic Regression ([Image: see text] = 0.38, [Image: see text] = 0.04, and [Image: see text]<0.0001). DOC-SVM produced a model that was better calibrated without sacrificing discrimination, and hence may be helpful in clinical decision making.