Cargando…

An interpretable semi-supervised framework for patch-based classification of breast cancer

Developing effective invasive Ductal Carcinoma (IDC) detection methods remains a challenging problem for breast cancer diagnosis. Recently, there has been notable success in utilizing deep neural networks in various application domains; however, it is well-known that deep neural networks require a l...

Descripción completa

Detalles Bibliográficos
Autores principales: Shawi, Radwa El, Kilanava, Khatia, Sakr, Sherif
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9537500/
https://www.ncbi.nlm.nih.gov/pubmed/36202832
http://dx.doi.org/10.1038/s41598-022-20268-7
_version_ 1784803214932049920
author Shawi, Radwa El
Kilanava, Khatia
Sakr, Sherif
author_facet Shawi, Radwa El
Kilanava, Khatia
Sakr, Sherif
author_sort Shawi, Radwa El
collection PubMed
description Developing effective invasive Ductal Carcinoma (IDC) detection methods remains a challenging problem for breast cancer diagnosis. Recently, there has been notable success in utilizing deep neural networks in various application domains; however, it is well-known that deep neural networks require a large amount of labelled training data to achieve high accuracy. Such amounts of manually labelled data are time-consuming and expensive, especially when domain expertise is required. To this end, we present a novel semi-supervised learning framework for IDC detection using small amounts of labelled training examples to take advantage of cheap available unlabeled data. To gain trust in the prediction of the framework, we explain the prediction globally. Our proposed framework consists of five main stages: data augmentation, feature selection, dividing co-training data labelling, deep neural network modelling, and the interpretability of neural network prediction. The data cohort used in this study contains digitized BCa histopathology slides from 162 women with IDC at the Hospital of the University of Pennsylvania and the Cancer Institute of New Jersey. To evaluate the effectiveness of the deep neural network model used by the proposed approach, we compare it to different state-of-the-art network architectures; AlexNet and a shallow VGG network trained only on the labelled data. The results show that the deep neural network used in our proposed approach outperforms the state-of-the-art techniques achieving balanced accuracy of 0.73 and F-measure of 0.843. In addition, we compare the performance of the proposed semi-supervised approach to state-of-the-art semi-supervised DCGAN technique and self-learning technique. The experimental evaluation shows that our framework outperforms both semi-supervised techniques and detects IDC with an accuracy of 85.75%, a balanced accuracy of 0.865, and an F-measure of 0.773 using only 10% labelled instances from the training dataset while the rest of the training dataset is treated as unlabeled.
format Online
Article
Text
id pubmed-9537500
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-95375002022-10-08 An interpretable semi-supervised framework for patch-based classification of breast cancer Shawi, Radwa El Kilanava, Khatia Sakr, Sherif Sci Rep Article Developing effective invasive Ductal Carcinoma (IDC) detection methods remains a challenging problem for breast cancer diagnosis. Recently, there has been notable success in utilizing deep neural networks in various application domains; however, it is well-known that deep neural networks require a large amount of labelled training data to achieve high accuracy. Such amounts of manually labelled data are time-consuming and expensive, especially when domain expertise is required. To this end, we present a novel semi-supervised learning framework for IDC detection using small amounts of labelled training examples to take advantage of cheap available unlabeled data. To gain trust in the prediction of the framework, we explain the prediction globally. Our proposed framework consists of five main stages: data augmentation, feature selection, dividing co-training data labelling, deep neural network modelling, and the interpretability of neural network prediction. The data cohort used in this study contains digitized BCa histopathology slides from 162 women with IDC at the Hospital of the University of Pennsylvania and the Cancer Institute of New Jersey. To evaluate the effectiveness of the deep neural network model used by the proposed approach, we compare it to different state-of-the-art network architectures; AlexNet and a shallow VGG network trained only on the labelled data. The results show that the deep neural network used in our proposed approach outperforms the state-of-the-art techniques achieving balanced accuracy of 0.73 and F-measure of 0.843. In addition, we compare the performance of the proposed semi-supervised approach to state-of-the-art semi-supervised DCGAN technique and self-learning technique. The experimental evaluation shows that our framework outperforms both semi-supervised techniques and detects IDC with an accuracy of 85.75%, a balanced accuracy of 0.865, and an F-measure of 0.773 using only 10% labelled instances from the training dataset while the rest of the training dataset is treated as unlabeled. Nature Publishing Group UK 2022-10-06 /pmc/articles/PMC9537500/ /pubmed/36202832 http://dx.doi.org/10.1038/s41598-022-20268-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Shawi, Radwa El
Kilanava, Khatia
Sakr, Sherif
An interpretable semi-supervised framework for patch-based classification of breast cancer
title An interpretable semi-supervised framework for patch-based classification of breast cancer
title_full An interpretable semi-supervised framework for patch-based classification of breast cancer
title_fullStr An interpretable semi-supervised framework for patch-based classification of breast cancer
title_full_unstemmed An interpretable semi-supervised framework for patch-based classification of breast cancer
title_short An interpretable semi-supervised framework for patch-based classification of breast cancer
title_sort interpretable semi-supervised framework for patch-based classification of breast cancer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9537500/
https://www.ncbi.nlm.nih.gov/pubmed/36202832
http://dx.doi.org/10.1038/s41598-022-20268-7
work_keys_str_mv AT shawiradwael aninterpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer
AT kilanavakhatia aninterpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer
AT sakrsherif aninterpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer
AT shawiradwael interpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer
AT kilanavakhatia interpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer
AT sakrsherif interpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer