Cargando…

AUCReshaping: improved sensitivity at high-specificity

The evaluation of deep-learning (DL) systems typically relies on the Area under the Receiver-Operating-Curve (AU-ROC) as a performance metric. However, AU-ROC, in its holistic form, does not sufficiently consider performance within specific ranges of sensitivity and specificity, which are critical f...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhat, Sheethal, Mansoor, Awais, Georgescu, Bogdan, Panambur, Adarsh B., Ghesu, Florin C., Islam, Saahil, Packhäuser, Kai, Rodríguez-Salas, Dalia, Grbic, Sasa, Maier, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10689839/
https://www.ncbi.nlm.nih.gov/pubmed/38036602
http://dx.doi.org/10.1038/s41598-023-48482-x
_version_ 1785152433839669248
author Bhat, Sheethal
Mansoor, Awais
Georgescu, Bogdan
Panambur, Adarsh B.
Ghesu, Florin C.
Islam, Saahil
Packhäuser, Kai
Rodríguez-Salas, Dalia
Grbic, Sasa
Maier, Andreas
author_facet Bhat, Sheethal
Mansoor, Awais
Georgescu, Bogdan
Panambur, Adarsh B.
Ghesu, Florin C.
Islam, Saahil
Packhäuser, Kai
Rodríguez-Salas, Dalia
Grbic, Sasa
Maier, Andreas
author_sort Bhat, Sheethal
collection PubMed
description The evaluation of deep-learning (DL) systems typically relies on the Area under the Receiver-Operating-Curve (AU-ROC) as a performance metric. However, AU-ROC, in its holistic form, does not sufficiently consider performance within specific ranges of sensitivity and specificity, which are critical for the intended operational context of the system. Consequently, two systems with identical AU-ROC values can exhibit significantly divergent real-world performance. This issue is particularly pronounced in the context of anomaly detection tasks, a commonly employed application of DL systems across various research domains, including medical imaging, industrial automation, manufacturing, cyber security, fraud detection, and drug research, among others. The challenge arises from the heavy class imbalance in training datasets, with the abnormality class often incurring a considerably higher misclassification cost compared to the normal class. Traditional DL systems address this by adjusting the weighting of the cost function or optimizing for specific points along the ROC curve. While these approaches yield reasonable results in many cases, they do not actively seek to maximize performance for the desired operating point. In this study, we introduce a novel technique known as AUCReshaping, designed to reshape the ROC curve exclusively within the specified sensitivity and specificity range, by optimizing sensitivity at a predetermined specificity level. This reshaping is achieved through an adaptive and iterative boosting mechanism that allows the network to focus on pertinent samples during the learning process. We primarily investigated the impact of AUCReshaping in the context of abnormality detection tasks, specifically in Chest X-Ray (CXR) analysis, followed by breast mammogram and credit card fraud detection tasks. The results reveal a substantial improvement, ranging from 2 to 40%, in sensitivity at high-specificity levels for binary classification tasks.
format Online
Article
Text
id pubmed-10689839
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106898392023-12-02 AUCReshaping: improved sensitivity at high-specificity Bhat, Sheethal Mansoor, Awais Georgescu, Bogdan Panambur, Adarsh B. Ghesu, Florin C. Islam, Saahil Packhäuser, Kai Rodríguez-Salas, Dalia Grbic, Sasa Maier, Andreas Sci Rep Article The evaluation of deep-learning (DL) systems typically relies on the Area under the Receiver-Operating-Curve (AU-ROC) as a performance metric. However, AU-ROC, in its holistic form, does not sufficiently consider performance within specific ranges of sensitivity and specificity, which are critical for the intended operational context of the system. Consequently, two systems with identical AU-ROC values can exhibit significantly divergent real-world performance. This issue is particularly pronounced in the context of anomaly detection tasks, a commonly employed application of DL systems across various research domains, including medical imaging, industrial automation, manufacturing, cyber security, fraud detection, and drug research, among others. The challenge arises from the heavy class imbalance in training datasets, with the abnormality class often incurring a considerably higher misclassification cost compared to the normal class. Traditional DL systems address this by adjusting the weighting of the cost function or optimizing for specific points along the ROC curve. While these approaches yield reasonable results in many cases, they do not actively seek to maximize performance for the desired operating point. In this study, we introduce a novel technique known as AUCReshaping, designed to reshape the ROC curve exclusively within the specified sensitivity and specificity range, by optimizing sensitivity at a predetermined specificity level. This reshaping is achieved through an adaptive and iterative boosting mechanism that allows the network to focus on pertinent samples during the learning process. We primarily investigated the impact of AUCReshaping in the context of abnormality detection tasks, specifically in Chest X-Ray (CXR) analysis, followed by breast mammogram and credit card fraud detection tasks. The results reveal a substantial improvement, ranging from 2 to 40%, in sensitivity at high-specificity levels for binary classification tasks. Nature Publishing Group UK 2023-11-30 /pmc/articles/PMC10689839/ /pubmed/38036602 http://dx.doi.org/10.1038/s41598-023-48482-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Bhat, Sheethal
Mansoor, Awais
Georgescu, Bogdan
Panambur, Adarsh B.
Ghesu, Florin C.
Islam, Saahil
Packhäuser, Kai
Rodríguez-Salas, Dalia
Grbic, Sasa
Maier, Andreas
AUCReshaping: improved sensitivity at high-specificity
title AUCReshaping: improved sensitivity at high-specificity
title_full AUCReshaping: improved sensitivity at high-specificity
title_fullStr AUCReshaping: improved sensitivity at high-specificity
title_full_unstemmed AUCReshaping: improved sensitivity at high-specificity
title_short AUCReshaping: improved sensitivity at high-specificity
title_sort aucreshaping: improved sensitivity at high-specificity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10689839/
https://www.ncbi.nlm.nih.gov/pubmed/38036602
http://dx.doi.org/10.1038/s41598-023-48482-x
work_keys_str_mv AT bhatsheethal aucreshapingimprovedsensitivityathighspecificity
AT mansoorawais aucreshapingimprovedsensitivityathighspecificity
AT georgescubogdan aucreshapingimprovedsensitivityathighspecificity
AT panamburadarshb aucreshapingimprovedsensitivityathighspecificity
AT ghesuflorinc aucreshapingimprovedsensitivityathighspecificity
AT islamsaahil aucreshapingimprovedsensitivityathighspecificity
AT packhauserkai aucreshapingimprovedsensitivityathighspecificity
AT rodriguezsalasdalia aucreshapingimprovedsensitivityathighspecificity
AT grbicsasa aucreshapingimprovedsensitivityathighspecificity
AT maierandreas aucreshapingimprovedsensitivityathighspecificity