Cargando…

Identification of misdiagnosis by deep neural networks on a histopathologic review of breast cancer lymph node metastases

The frozen section (FS) diagnoses of pathology experts are used in China to determine whether sentinel lymph nodes of breast cancer have metastasis during operation. Direct implementation of a deep neural network (DNN) in clinical practice may be hindered by misdiagnosis of the algorithm, which affe...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Cancan, Zheng, Shan, Guo, Lei, Yang, Xuebing, Song, Yan, Li, Zhuo, Zhu, Yanwu, Liu, Xiaoqi, Li, Qingzhuang, Zhang, Huijuan, Feng, Ning, Zhao, Zuxuan, Qiu, Tinglin, Du, Jun, Guo, Qiang, Zhang, Wensheng, Shi, Wenzhao, Ma, Jianhui, Sun, Fenglong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9355979/
https://www.ncbi.nlm.nih.gov/pubmed/35931718
http://dx.doi.org/10.1038/s41598-022-17606-0
Descripción
Sumario:The frozen section (FS) diagnoses of pathology experts are used in China to determine whether sentinel lymph nodes of breast cancer have metastasis during operation. Direct implementation of a deep neural network (DNN) in clinical practice may be hindered by misdiagnosis of the algorithm, which affects a patient's treatment decision. In this study, we first obtained the prediction result of the commonly used patch-DNN, then we present a relative risk classification and regression tree (RRCART) to identify the misdiagnosed whole-slide images (WSIs) and recommend them to be reviewed by pathologists. Applying this framework to 2362 WSIs of breast cancer lymph node metastasis, test on frozen section results in the mean area under the curve (AUC) reached 0.9851. However, the mean misdiagnosis rate (0.0248), was significantly higher than the pathologists’ misdiagnosis rate (p < 0.01). The RRCART distinguished more than 80% of the WSIs as a high-accuracy group with an average accuracy reached to 0.995, but the difference with the pathologists’ performance was not significant (p > 0.01). However, the other low-accuracy group included most of the misdiagnoses of DNN models. Our research shows that the misdiagnosis from deep learning model can be further enriched by our method, and that the low-accuracy WSIs must be selected for pathologists to review and the high-accuracy ones may be ready for pathologists to give diagnostic reports.