Cargando…

Data-Efficient Computational Pathology Platform for Faster and Cheaper Breast Cancer Subtype Identifications: Development of a Deep Learning Model

BACKGROUND: Breast cancer subtyping is a crucial step in determining therapeutic options, but the molecular examination based on immunohistochemical staining is expensive and time-consuming. Deep learning opens up the possibility to predict the subtypes based on the morphological information from he...

Descripción completa

Detalles Bibliográficos
Autores principales: Bae, Kideog, Jeon, Young Seok, Hwangbo, Yul, Yoo, Chong Woo, Han, Nayoung, Feng, Mengling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10509735/
https://www.ncbi.nlm.nih.gov/pubmed/37669090
http://dx.doi.org/10.2196/45547
_version_ 1785107808532824064
author Bae, Kideog
Jeon, Young Seok
Hwangbo, Yul
Yoo, Chong Woo
Han, Nayoung
Feng, Mengling
author_facet Bae, Kideog
Jeon, Young Seok
Hwangbo, Yul
Yoo, Chong Woo
Han, Nayoung
Feng, Mengling
author_sort Bae, Kideog
collection PubMed
description BACKGROUND: Breast cancer subtyping is a crucial step in determining therapeutic options, but the molecular examination based on immunohistochemical staining is expensive and time-consuming. Deep learning opens up the possibility to predict the subtypes based on the morphological information from hematoxylin and eosin staining, a much cheaper and faster alternative. However, training the predictive model conventionally requires a large number of histology images, which is challenging to collect by a single institute. OBJECTIVE: We aimed to develop a data-efficient computational pathology platform, 3DHistoNet, which is capable of learning from z-stacked histology images to accurately predict breast cancer subtypes with a small sample size. METHODS: We retrospectively examined 401 cases of patients with primary breast carcinoma diagnosed between 2018 and 2020 at the Department of Pathology, National Cancer Center, South Korea. Pathology slides of the patients with breast carcinoma were prepared according to the standard protocols. Age, gender, histologic grade, hormone receptor (estrogen receptor [ER], progesterone receptor [PR], and androgen receptor [AR]) status, erb-B2 receptor tyrosine kinase 2 (HER2) status, and Ki-67 index were evaluated by reviewing medical charts and pathological records. RESULTS: The area under the receiver operating characteristic curve and decision curve were analyzed to evaluate the performance of our 3DHistoNet platform for predicting the ER, PR, AR, HER2, and Ki67 subtype biomarkers with 5-fold cross-validation. We demonstrated that 3DHistoNet can predict all clinically important biomarkers (ER, PR, AR, HER2, and Ki67) with performance exceeding the conventional multiple instance learning models by a considerable margin (area under the receiver operating characteristic curve: 0.75-0.91 vs 0.67-0.8). We further showed that our z-stack histology scanning method can make up for insufficient training data sets without any additional cost incurred. Finally, 3DHistoNet offered an additional capability to generate attention maps that reveal correlations between Ki67 and histomorphological features, which renders the hematoxylin and eosin image in higher fidelity to the pathologist. CONCLUSIONS: Our stand-alone, data-efficient pathology platform that can both generate z-stacked images and predict key biomarkers is an appealing tool for breast cancer diagnosis. Its development would encourage morphology-based diagnosis, which is faster, cheaper, and less error-prone compared to the protein quantification method based on immunohistochemical staining.
format Online
Article
Text
id pubmed-10509735
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-105097352023-09-21 Data-Efficient Computational Pathology Platform for Faster and Cheaper Breast Cancer Subtype Identifications: Development of a Deep Learning Model Bae, Kideog Jeon, Young Seok Hwangbo, Yul Yoo, Chong Woo Han, Nayoung Feng, Mengling JMIR Cancer Original Paper BACKGROUND: Breast cancer subtyping is a crucial step in determining therapeutic options, but the molecular examination based on immunohistochemical staining is expensive and time-consuming. Deep learning opens up the possibility to predict the subtypes based on the morphological information from hematoxylin and eosin staining, a much cheaper and faster alternative. However, training the predictive model conventionally requires a large number of histology images, which is challenging to collect by a single institute. OBJECTIVE: We aimed to develop a data-efficient computational pathology platform, 3DHistoNet, which is capable of learning from z-stacked histology images to accurately predict breast cancer subtypes with a small sample size. METHODS: We retrospectively examined 401 cases of patients with primary breast carcinoma diagnosed between 2018 and 2020 at the Department of Pathology, National Cancer Center, South Korea. Pathology slides of the patients with breast carcinoma were prepared according to the standard protocols. Age, gender, histologic grade, hormone receptor (estrogen receptor [ER], progesterone receptor [PR], and androgen receptor [AR]) status, erb-B2 receptor tyrosine kinase 2 (HER2) status, and Ki-67 index were evaluated by reviewing medical charts and pathological records. RESULTS: The area under the receiver operating characteristic curve and decision curve were analyzed to evaluate the performance of our 3DHistoNet platform for predicting the ER, PR, AR, HER2, and Ki67 subtype biomarkers with 5-fold cross-validation. We demonstrated that 3DHistoNet can predict all clinically important biomarkers (ER, PR, AR, HER2, and Ki67) with performance exceeding the conventional multiple instance learning models by a considerable margin (area under the receiver operating characteristic curve: 0.75-0.91 vs 0.67-0.8). We further showed that our z-stack histology scanning method can make up for insufficient training data sets without any additional cost incurred. Finally, 3DHistoNet offered an additional capability to generate attention maps that reveal correlations between Ki67 and histomorphological features, which renders the hematoxylin and eosin image in higher fidelity to the pathologist. CONCLUSIONS: Our stand-alone, data-efficient pathology platform that can both generate z-stacked images and predict key biomarkers is an appealing tool for breast cancer diagnosis. Its development would encourage morphology-based diagnosis, which is faster, cheaper, and less error-prone compared to the protein quantification method based on immunohistochemical staining. JMIR Publications 2023-09-05 /pmc/articles/PMC10509735/ /pubmed/37669090 http://dx.doi.org/10.2196/45547 Text en ©Kideog Bae, Young Seok Jeon, Yul Hwangbo, Chong Woo Yoo, Nayoung Han, Mengling Feng. Originally published in JMIR Cancer (https://cancer.jmir.org), 05.09.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Cancer, is properly cited. The complete bibliographic information, a link to the original publication on https://cancer.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Bae, Kideog
Jeon, Young Seok
Hwangbo, Yul
Yoo, Chong Woo
Han, Nayoung
Feng, Mengling
Data-Efficient Computational Pathology Platform for Faster and Cheaper Breast Cancer Subtype Identifications: Development of a Deep Learning Model
title Data-Efficient Computational Pathology Platform for Faster and Cheaper Breast Cancer Subtype Identifications: Development of a Deep Learning Model
title_full Data-Efficient Computational Pathology Platform for Faster and Cheaper Breast Cancer Subtype Identifications: Development of a Deep Learning Model
title_fullStr Data-Efficient Computational Pathology Platform for Faster and Cheaper Breast Cancer Subtype Identifications: Development of a Deep Learning Model
title_full_unstemmed Data-Efficient Computational Pathology Platform for Faster and Cheaper Breast Cancer Subtype Identifications: Development of a Deep Learning Model
title_short Data-Efficient Computational Pathology Platform for Faster and Cheaper Breast Cancer Subtype Identifications: Development of a Deep Learning Model
title_sort data-efficient computational pathology platform for faster and cheaper breast cancer subtype identifications: development of a deep learning model
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10509735/
https://www.ncbi.nlm.nih.gov/pubmed/37669090
http://dx.doi.org/10.2196/45547
work_keys_str_mv AT baekideog dataefficientcomputationalpathologyplatformforfasterandcheaperbreastcancersubtypeidentificationsdevelopmentofadeeplearningmodel
AT jeonyoungseok dataefficientcomputationalpathologyplatformforfasterandcheaperbreastcancersubtypeidentificationsdevelopmentofadeeplearningmodel
AT hwangboyul dataefficientcomputationalpathologyplatformforfasterandcheaperbreastcancersubtypeidentificationsdevelopmentofadeeplearningmodel
AT yoochongwoo dataefficientcomputationalpathologyplatformforfasterandcheaperbreastcancersubtypeidentificationsdevelopmentofadeeplearningmodel
AT hannayoung dataefficientcomputationalpathologyplatformforfasterandcheaperbreastcancersubtypeidentificationsdevelopmentofadeeplearningmodel
AT fengmengling dataefficientcomputationalpathologyplatformforfasterandcheaperbreastcancersubtypeidentificationsdevelopmentofadeeplearningmodel