Cargando…
An interpretable semi-supervised framework for patch-based classification of breast cancer
Developing effective invasive Ductal Carcinoma (IDC) detection methods remains a challenging problem for breast cancer diagnosis. Recently, there has been notable success in utilizing deep neural networks in various application domains; however, it is well-known that deep neural networks require a l...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9537500/ https://www.ncbi.nlm.nih.gov/pubmed/36202832 http://dx.doi.org/10.1038/s41598-022-20268-7 |
_version_ | 1784803214932049920 |
---|---|
author | Shawi, Radwa El Kilanava, Khatia Sakr, Sherif |
author_facet | Shawi, Radwa El Kilanava, Khatia Sakr, Sherif |
author_sort | Shawi, Radwa El |
collection | PubMed |
description | Developing effective invasive Ductal Carcinoma (IDC) detection methods remains a challenging problem for breast cancer diagnosis. Recently, there has been notable success in utilizing deep neural networks in various application domains; however, it is well-known that deep neural networks require a large amount of labelled training data to achieve high accuracy. Such amounts of manually labelled data are time-consuming and expensive, especially when domain expertise is required. To this end, we present a novel semi-supervised learning framework for IDC detection using small amounts of labelled training examples to take advantage of cheap available unlabeled data. To gain trust in the prediction of the framework, we explain the prediction globally. Our proposed framework consists of five main stages: data augmentation, feature selection, dividing co-training data labelling, deep neural network modelling, and the interpretability of neural network prediction. The data cohort used in this study contains digitized BCa histopathology slides from 162 women with IDC at the Hospital of the University of Pennsylvania and the Cancer Institute of New Jersey. To evaluate the effectiveness of the deep neural network model used by the proposed approach, we compare it to different state-of-the-art network architectures; AlexNet and a shallow VGG network trained only on the labelled data. The results show that the deep neural network used in our proposed approach outperforms the state-of-the-art techniques achieving balanced accuracy of 0.73 and F-measure of 0.843. In addition, we compare the performance of the proposed semi-supervised approach to state-of-the-art semi-supervised DCGAN technique and self-learning technique. The experimental evaluation shows that our framework outperforms both semi-supervised techniques and detects IDC with an accuracy of 85.75%, a balanced accuracy of 0.865, and an F-measure of 0.773 using only 10% labelled instances from the training dataset while the rest of the training dataset is treated as unlabeled. |
format | Online Article Text |
id | pubmed-9537500 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-95375002022-10-08 An interpretable semi-supervised framework for patch-based classification of breast cancer Shawi, Radwa El Kilanava, Khatia Sakr, Sherif Sci Rep Article Developing effective invasive Ductal Carcinoma (IDC) detection methods remains a challenging problem for breast cancer diagnosis. Recently, there has been notable success in utilizing deep neural networks in various application domains; however, it is well-known that deep neural networks require a large amount of labelled training data to achieve high accuracy. Such amounts of manually labelled data are time-consuming and expensive, especially when domain expertise is required. To this end, we present a novel semi-supervised learning framework for IDC detection using small amounts of labelled training examples to take advantage of cheap available unlabeled data. To gain trust in the prediction of the framework, we explain the prediction globally. Our proposed framework consists of five main stages: data augmentation, feature selection, dividing co-training data labelling, deep neural network modelling, and the interpretability of neural network prediction. The data cohort used in this study contains digitized BCa histopathology slides from 162 women with IDC at the Hospital of the University of Pennsylvania and the Cancer Institute of New Jersey. To evaluate the effectiveness of the deep neural network model used by the proposed approach, we compare it to different state-of-the-art network architectures; AlexNet and a shallow VGG network trained only on the labelled data. The results show that the deep neural network used in our proposed approach outperforms the state-of-the-art techniques achieving balanced accuracy of 0.73 and F-measure of 0.843. In addition, we compare the performance of the proposed semi-supervised approach to state-of-the-art semi-supervised DCGAN technique and self-learning technique. The experimental evaluation shows that our framework outperforms both semi-supervised techniques and detects IDC with an accuracy of 85.75%, a balanced accuracy of 0.865, and an F-measure of 0.773 using only 10% labelled instances from the training dataset while the rest of the training dataset is treated as unlabeled. Nature Publishing Group UK 2022-10-06 /pmc/articles/PMC9537500/ /pubmed/36202832 http://dx.doi.org/10.1038/s41598-022-20268-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Shawi, Radwa El Kilanava, Khatia Sakr, Sherif An interpretable semi-supervised framework for patch-based classification of breast cancer |
title | An interpretable semi-supervised framework for patch-based classification of breast cancer |
title_full | An interpretable semi-supervised framework for patch-based classification of breast cancer |
title_fullStr | An interpretable semi-supervised framework for patch-based classification of breast cancer |
title_full_unstemmed | An interpretable semi-supervised framework for patch-based classification of breast cancer |
title_short | An interpretable semi-supervised framework for patch-based classification of breast cancer |
title_sort | interpretable semi-supervised framework for patch-based classification of breast cancer |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9537500/ https://www.ncbi.nlm.nih.gov/pubmed/36202832 http://dx.doi.org/10.1038/s41598-022-20268-7 |
work_keys_str_mv | AT shawiradwael aninterpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer AT kilanavakhatia aninterpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer AT sakrsherif aninterpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer AT shawiradwael interpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer AT kilanavakhatia interpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer AT sakrsherif interpretablesemisupervisedframeworkforpatchbasedclassificationofbreastcancer |