Cargando…

Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve

BACKGROUND: The linear combination of variables is an attractive method in many medical analyses targeting a score to classify patients. In the case of ROC curves the most popular problem is to identify the linear combination which maximizes area under curve (AUC). This problem is complete closed wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Todor, Nicolae, Todor, Irina, Săplăcan, Gavril
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4099021/
https://www.ncbi.nlm.nih.gov/pubmed/25068036
http://dx.doi.org/10.1186/2043-9113-4-10
_version_ 1782326418731958272
author Todor, Nicolae
Todor, Irina
Săplăcan, Gavril
author_facet Todor, Nicolae
Todor, Irina
Săplăcan, Gavril
author_sort Todor, Nicolae
collection PubMed
description BACKGROUND: The linear combination of variables is an attractive method in many medical analyses targeting a score to classify patients. In the case of ROC curves the most popular problem is to identify the linear combination which maximizes area under curve (AUC). This problem is complete closed when normality assumptions are met. With no assumption of normality search algorithm are avoided because it is accepted that we have to evaluate AUC n(d) times where n is the number of distinct observation and d is the number of variables. METHODS: For d = 2, using particularities of AUC formula, we described an algorithm which lowered the number of evaluations of AUC from n(2) to n(n-1) + 1. For d > 2 our proposed solution is an approximate method by considering equidistant points on the unit sphere in R(d) where we evaluate AUC. RESULTS: The algorithms were applied to data from our lab to predict response of treatment by a set of molecular markers in cervical cancers patients. In order to evaluate the strength of our algorithms a simulation was added. CONCLUSIONS: In the case of no normality presented algorithms are feasible. For many variables computation time could be increased but acceptable.
format Online
Article
Text
id pubmed-4099021
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40990212014-07-25 Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve Todor, Nicolae Todor, Irina Săplăcan, Gavril J Clin Bioinforma Research BACKGROUND: The linear combination of variables is an attractive method in many medical analyses targeting a score to classify patients. In the case of ROC curves the most popular problem is to identify the linear combination which maximizes area under curve (AUC). This problem is complete closed when normality assumptions are met. With no assumption of normality search algorithm are avoided because it is accepted that we have to evaluate AUC n(d) times where n is the number of distinct observation and d is the number of variables. METHODS: For d = 2, using particularities of AUC formula, we described an algorithm which lowered the number of evaluations of AUC from n(2) to n(n-1) + 1. For d > 2 our proposed solution is an approximate method by considering equidistant points on the unit sphere in R(d) where we evaluate AUC. RESULTS: The algorithms were applied to data from our lab to predict response of treatment by a set of molecular markers in cervical cancers patients. In order to evaluate the strength of our algorithms a simulation was added. CONCLUSIONS: In the case of no normality presented algorithms are feasible. For many variables computation time could be increased but acceptable. BioMed Central 2014-07-04 /pmc/articles/PMC4099021/ /pubmed/25068036 http://dx.doi.org/10.1186/2043-9113-4-10 Text en Copyright © 2014 Todor et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Todor, Nicolae
Todor, Irina
Săplăcan, Gavril
Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve
title Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve
title_full Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve
title_fullStr Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve
title_full_unstemmed Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve
title_short Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve
title_sort tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4099021/
https://www.ncbi.nlm.nih.gov/pubmed/25068036
http://dx.doi.org/10.1186/2043-9113-4-10
work_keys_str_mv AT todornicolae toolstoidentifylinearcombinationofprognosticfactorswhichmaximizesareaunderreceiveroperatorcurve
AT todoririna toolstoidentifylinearcombinationofprognosticfactorswhichmaximizesareaunderreceiveroperatorcurve
AT saplacangavril toolstoidentifylinearcombinationofprognosticfactorswhichmaximizesareaunderreceiveroperatorcurve