Cargando…
Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve
BACKGROUND: The linear combination of variables is an attractive method in many medical analyses targeting a score to classify patients. In the case of ROC curves the most popular problem is to identify the linear combination which maximizes area under curve (AUC). This problem is complete closed wh...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4099021/ https://www.ncbi.nlm.nih.gov/pubmed/25068036 http://dx.doi.org/10.1186/2043-9113-4-10 |
_version_ | 1782326418731958272 |
---|---|
author | Todor, Nicolae Todor, Irina Săplăcan, Gavril |
author_facet | Todor, Nicolae Todor, Irina Săplăcan, Gavril |
author_sort | Todor, Nicolae |
collection | PubMed |
description | BACKGROUND: The linear combination of variables is an attractive method in many medical analyses targeting a score to classify patients. In the case of ROC curves the most popular problem is to identify the linear combination which maximizes area under curve (AUC). This problem is complete closed when normality assumptions are met. With no assumption of normality search algorithm are avoided because it is accepted that we have to evaluate AUC n(d) times where n is the number of distinct observation and d is the number of variables. METHODS: For d = 2, using particularities of AUC formula, we described an algorithm which lowered the number of evaluations of AUC from n(2) to n(n-1) + 1. For d > 2 our proposed solution is an approximate method by considering equidistant points on the unit sphere in R(d) where we evaluate AUC. RESULTS: The algorithms were applied to data from our lab to predict response of treatment by a set of molecular markers in cervical cancers patients. In order to evaluate the strength of our algorithms a simulation was added. CONCLUSIONS: In the case of no normality presented algorithms are feasible. For many variables computation time could be increased but acceptable. |
format | Online Article Text |
id | pubmed-4099021 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40990212014-07-25 Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve Todor, Nicolae Todor, Irina Săplăcan, Gavril J Clin Bioinforma Research BACKGROUND: The linear combination of variables is an attractive method in many medical analyses targeting a score to classify patients. In the case of ROC curves the most popular problem is to identify the linear combination which maximizes area under curve (AUC). This problem is complete closed when normality assumptions are met. With no assumption of normality search algorithm are avoided because it is accepted that we have to evaluate AUC n(d) times where n is the number of distinct observation and d is the number of variables. METHODS: For d = 2, using particularities of AUC formula, we described an algorithm which lowered the number of evaluations of AUC from n(2) to n(n-1) + 1. For d > 2 our proposed solution is an approximate method by considering equidistant points on the unit sphere in R(d) where we evaluate AUC. RESULTS: The algorithms were applied to data from our lab to predict response of treatment by a set of molecular markers in cervical cancers patients. In order to evaluate the strength of our algorithms a simulation was added. CONCLUSIONS: In the case of no normality presented algorithms are feasible. For many variables computation time could be increased but acceptable. BioMed Central 2014-07-04 /pmc/articles/PMC4099021/ /pubmed/25068036 http://dx.doi.org/10.1186/2043-9113-4-10 Text en Copyright © 2014 Todor et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Todor, Nicolae Todor, Irina Săplăcan, Gavril Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve |
title | Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve |
title_full | Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve |
title_fullStr | Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve |
title_full_unstemmed | Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve |
title_short | Tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve |
title_sort | tools to identify linear combination of prognostic factors which maximizes area under receiver operator curve |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4099021/ https://www.ncbi.nlm.nih.gov/pubmed/25068036 http://dx.doi.org/10.1186/2043-9113-4-10 |
work_keys_str_mv | AT todornicolae toolstoidentifylinearcombinationofprognosticfactorswhichmaximizesareaunderreceiveroperatorcurve AT todoririna toolstoidentifylinearcombinationofprognosticfactorswhichmaximizesareaunderreceiveroperatorcurve AT saplacangavril toolstoidentifylinearcombinationofprognosticfactorswhichmaximizesareaunderreceiveroperatorcurve |