Cargando…

Multi-site cross-organ calibrated deep learning (MuSClD): Automated diagnosis of non-melanoma skin cancer()

Although deep learning (DL) has demonstrated impressive diagnostic performance for a variety of computational pathology tasks, this performance often markedly deteriorates on whole slide images (WSI) generated at external test sites. This phenomenon is due in part to domain shift, wherein difference...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yufei, Koyuncu, Can, Lu, Cheng, Grobholz, Rainer, Katz, Ian, Madabhushi, Anant, Janowczyk, Andrew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825103/
https://www.ncbi.nlm.nih.gov/pubmed/36516556
http://dx.doi.org/10.1016/j.media.2022.102702
_version_ 1784866566079250432
author Zhou, Yufei
Koyuncu, Can
Lu, Cheng
Grobholz, Rainer
Katz, Ian
Madabhushi, Anant
Janowczyk, Andrew
author_facet Zhou, Yufei
Koyuncu, Can
Lu, Cheng
Grobholz, Rainer
Katz, Ian
Madabhushi, Anant
Janowczyk, Andrew
author_sort Zhou, Yufei
collection PubMed
description Although deep learning (DL) has demonstrated impressive diagnostic performance for a variety of computational pathology tasks, this performance often markedly deteriorates on whole slide images (WSI) generated at external test sites. This phenomenon is due in part to domain shift, wherein differences in test-site pre-analytical variables (e.g., slide scanner, staining procedure) result in WSI with notably different visual presentations compared to training data. To ameliorate pre-analytic variances, approaches such as CycleGAN can be used to calibrate visual properties of images between sites, with the intent of improving DL classifier generalizability. In this work, we present a new approach termed Multi-Site Cross-Organ Calibration based Deep Learning (MuSClD) that employs WSIs of an off-target organ for calibration created at the same site as the on-target organ, based off the assumption that cross-organ slides are subjected to a common set of pre-analytical sources of variance. We demonstrate that by using an off-target organ from the test site to calibrate training data, the domain shift between training and testing data can be mitigated. Importantly, this strategy uniquely guards against potential data leakage introduced during calibration, wherein information only available in the testing data is imparted on the training data. We evaluate MuSClD in the context of the automated diagnosis of non-melanoma skin cancer (NMSC). Specifically, we evaluated MuSClD for identifying and distinguishing (a) basal cell carcinoma (BCC), (b) in-situ squamous cell carcinomas (SCC-In Situ), and (c) invasive squamous cell carcinomas (SCC-Invasive), using an Australian (training, n = 85) and a Swiss (held-out testing, n = 352) cohort. Our experiments reveal that MuSCID reduces the Wasserstein distances between sites in terms of color, contrast, and brightness metrics, without imparting noticeable artifacts to training data. The NMSC-subtyping performance is statistically improved as a result of MuSCID in terms of one-vs. rest AUC: BCC (0.92 vs 0.87, p = 0.01), SCC-In Situ (0.87 vs 0.73, p = 0.15) and SCC-Invasive (0.92 vs 0.82, p = 1e-5). Compared to baseline NMSC-subtyping with no calibration, the internal validation results of MuSClD (BCC (0.98), SCC-In Situ (0.92), and SCC-Invasive (0.97)) suggest that while domain shift indeed degrades classification performance, our on-target calibration using off-target tissue can safely compensate for pre-analytical variabilities, while improving the robustness of the model.
format Online
Article
Text
id pubmed-9825103
institution National Center for Biotechnology Information
language English
publishDate 2023
record_format MEDLINE/PubMed
spelling pubmed-98251032023-02-01 Multi-site cross-organ calibrated deep learning (MuSClD): Automated diagnosis of non-melanoma skin cancer() Zhou, Yufei Koyuncu, Can Lu, Cheng Grobholz, Rainer Katz, Ian Madabhushi, Anant Janowczyk, Andrew Med Image Anal Article Although deep learning (DL) has demonstrated impressive diagnostic performance for a variety of computational pathology tasks, this performance often markedly deteriorates on whole slide images (WSI) generated at external test sites. This phenomenon is due in part to domain shift, wherein differences in test-site pre-analytical variables (e.g., slide scanner, staining procedure) result in WSI with notably different visual presentations compared to training data. To ameliorate pre-analytic variances, approaches such as CycleGAN can be used to calibrate visual properties of images between sites, with the intent of improving DL classifier generalizability. In this work, we present a new approach termed Multi-Site Cross-Organ Calibration based Deep Learning (MuSClD) that employs WSIs of an off-target organ for calibration created at the same site as the on-target organ, based off the assumption that cross-organ slides are subjected to a common set of pre-analytical sources of variance. We demonstrate that by using an off-target organ from the test site to calibrate training data, the domain shift between training and testing data can be mitigated. Importantly, this strategy uniquely guards against potential data leakage introduced during calibration, wherein information only available in the testing data is imparted on the training data. We evaluate MuSClD in the context of the automated diagnosis of non-melanoma skin cancer (NMSC). Specifically, we evaluated MuSClD for identifying and distinguishing (a) basal cell carcinoma (BCC), (b) in-situ squamous cell carcinomas (SCC-In Situ), and (c) invasive squamous cell carcinomas (SCC-Invasive), using an Australian (training, n = 85) and a Swiss (held-out testing, n = 352) cohort. Our experiments reveal that MuSCID reduces the Wasserstein distances between sites in terms of color, contrast, and brightness metrics, without imparting noticeable artifacts to training data. The NMSC-subtyping performance is statistically improved as a result of MuSCID in terms of one-vs. rest AUC: BCC (0.92 vs 0.87, p = 0.01), SCC-In Situ (0.87 vs 0.73, p = 0.15) and SCC-Invasive (0.92 vs 0.82, p = 1e-5). Compared to baseline NMSC-subtyping with no calibration, the internal validation results of MuSClD (BCC (0.98), SCC-In Situ (0.92), and SCC-Invasive (0.97)) suggest that while domain shift indeed degrades classification performance, our on-target calibration using off-target tissue can safely compensate for pre-analytical variabilities, while improving the robustness of the model. 2023-02 2022-11-24 /pmc/articles/PMC9825103/ /pubmed/36516556 http://dx.doi.org/10.1016/j.media.2022.102702 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ).
spellingShingle Article
Zhou, Yufei
Koyuncu, Can
Lu, Cheng
Grobholz, Rainer
Katz, Ian
Madabhushi, Anant
Janowczyk, Andrew
Multi-site cross-organ calibrated deep learning (MuSClD): Automated diagnosis of non-melanoma skin cancer()
title Multi-site cross-organ calibrated deep learning (MuSClD): Automated diagnosis of non-melanoma skin cancer()
title_full Multi-site cross-organ calibrated deep learning (MuSClD): Automated diagnosis of non-melanoma skin cancer()
title_fullStr Multi-site cross-organ calibrated deep learning (MuSClD): Automated diagnosis of non-melanoma skin cancer()
title_full_unstemmed Multi-site cross-organ calibrated deep learning (MuSClD): Automated diagnosis of non-melanoma skin cancer()
title_short Multi-site cross-organ calibrated deep learning (MuSClD): Automated diagnosis of non-melanoma skin cancer()
title_sort multi-site cross-organ calibrated deep learning (muscld): automated diagnosis of non-melanoma skin cancer()
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825103/
https://www.ncbi.nlm.nih.gov/pubmed/36516556
http://dx.doi.org/10.1016/j.media.2022.102702
work_keys_str_mv AT zhouyufei multisitecrossorgancalibrateddeeplearningmuscldautomateddiagnosisofnonmelanomaskincancer
AT koyuncucan multisitecrossorgancalibrateddeeplearningmuscldautomateddiagnosisofnonmelanomaskincancer
AT lucheng multisitecrossorgancalibrateddeeplearningmuscldautomateddiagnosisofnonmelanomaskincancer
AT grobholzrainer multisitecrossorgancalibrateddeeplearningmuscldautomateddiagnosisofnonmelanomaskincancer
AT katzian multisitecrossorgancalibrateddeeplearningmuscldautomateddiagnosisofnonmelanomaskincancer
AT madabhushianant multisitecrossorgancalibrateddeeplearningmuscldautomateddiagnosisofnonmelanomaskincancer
AT janowczykandrew multisitecrossorgancalibrateddeeplearningmuscldautomateddiagnosisofnonmelanomaskincancer