Cargando…

Performance comparison between multi-center histopathology datasets of a weakly-supervised deep learning model for pancreatic ductal adenocarcinoma detection

BACKGROUND: Pancreatic ductal carcinoma patients have a really poor prognosis given its difficult early detection and the lack of early symptoms. Digital pathology is routinely used by pathologists to diagnose the disease. However, visually inspecting the tissue is a time-consuming task, which slows...

Descripción completa

Detalles Bibliográficos
Autores principales: Carrillo-Perez, Francisco, Ortuno, Francisco M., Börjesson, Alejandro, Rojas, Ignacio, Herrera, Luis Javier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10294485/
https://www.ncbi.nlm.nih.gov/pubmed/37365659
http://dx.doi.org/10.1186/s40644-023-00586-3
_version_ 1785063208145387520
author Carrillo-Perez, Francisco
Ortuno, Francisco M.
Börjesson, Alejandro
Rojas, Ignacio
Herrera, Luis Javier
author_facet Carrillo-Perez, Francisco
Ortuno, Francisco M.
Börjesson, Alejandro
Rojas, Ignacio
Herrera, Luis Javier
author_sort Carrillo-Perez, Francisco
collection PubMed
description BACKGROUND: Pancreatic ductal carcinoma patients have a really poor prognosis given its difficult early detection and the lack of early symptoms. Digital pathology is routinely used by pathologists to diagnose the disease. However, visually inspecting the tissue is a time-consuming task, which slows down the diagnostic procedure. With the advances occurred in the area of artificial intelligence, specifically with deep learning models, and the growing availability of public histology data, clinical decision support systems are being created. However, the generalization capabilities of these systems are not always tested, nor the integration of publicly available datasets for pancreatic ductal carcinoma detection (PDAC). METHODS: In this work, we explored the performace of two weakly-supervised deep learning models using the two more widely available datasets with pancreatic ductal carcinoma histology images, The Cancer Genome Atlas Project (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). In order to have sufficient training data, the TCGA dataset was integrated with the Genotype-Tissue Expression (GTEx) project dataset, which contains healthy pancreatic samples. RESULTS: We showed how the model trained on CPTAC generalizes better than the one trained on the integrated dataset, obtaining an inter-dataset accuracy of 90.62% ± 2.32 and an outer-dataset accuracy of 92.17% when evaluated on TCGA + GTEx. Furthermore, we tested the performance on another dataset formed by tissue micro-arrays, obtaining an accuracy of 98.59%. We showed how the features learned in an integrated dataset do not differentiate between the classes, but between the datasets, noticing that a stronger normalization might be needed when creating clinical decision support systems with datasets obtained from different sources. To mitigate this effect, we proposed to train on the three available datasets, improving the detection performance and generalization capabilities of a model trained only on TCGA + GTEx and achieving a similar performance to the model trained only on CPTAC. CONCLUSIONS: The integration of datasets where both classes are present can mitigate the batch effect present when integrating datasets, improving the classification performance, and accurately detecting PDAC across different datasets. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40644-023-00586-3.
format Online
Article
Text
id pubmed-10294485
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-102944852023-06-28 Performance comparison between multi-center histopathology datasets of a weakly-supervised deep learning model for pancreatic ductal adenocarcinoma detection Carrillo-Perez, Francisco Ortuno, Francisco M. Börjesson, Alejandro Rojas, Ignacio Herrera, Luis Javier Cancer Imaging Research Article BACKGROUND: Pancreatic ductal carcinoma patients have a really poor prognosis given its difficult early detection and the lack of early symptoms. Digital pathology is routinely used by pathologists to diagnose the disease. However, visually inspecting the tissue is a time-consuming task, which slows down the diagnostic procedure. With the advances occurred in the area of artificial intelligence, specifically with deep learning models, and the growing availability of public histology data, clinical decision support systems are being created. However, the generalization capabilities of these systems are not always tested, nor the integration of publicly available datasets for pancreatic ductal carcinoma detection (PDAC). METHODS: In this work, we explored the performace of two weakly-supervised deep learning models using the two more widely available datasets with pancreatic ductal carcinoma histology images, The Cancer Genome Atlas Project (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). In order to have sufficient training data, the TCGA dataset was integrated with the Genotype-Tissue Expression (GTEx) project dataset, which contains healthy pancreatic samples. RESULTS: We showed how the model trained on CPTAC generalizes better than the one trained on the integrated dataset, obtaining an inter-dataset accuracy of 90.62% ± 2.32 and an outer-dataset accuracy of 92.17% when evaluated on TCGA + GTEx. Furthermore, we tested the performance on another dataset formed by tissue micro-arrays, obtaining an accuracy of 98.59%. We showed how the features learned in an integrated dataset do not differentiate between the classes, but between the datasets, noticing that a stronger normalization might be needed when creating clinical decision support systems with datasets obtained from different sources. To mitigate this effect, we proposed to train on the three available datasets, improving the detection performance and generalization capabilities of a model trained only on TCGA + GTEx and achieving a similar performance to the model trained only on CPTAC. CONCLUSIONS: The integration of datasets where both classes are present can mitigate the batch effect present when integrating datasets, improving the classification performance, and accurately detecting PDAC across different datasets. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40644-023-00586-3. BioMed Central 2023-06-26 /pmc/articles/PMC10294485/ /pubmed/37365659 http://dx.doi.org/10.1186/s40644-023-00586-3 Text en © The Author(s) 2023, corrected publication 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Carrillo-Perez, Francisco
Ortuno, Francisco M.
Börjesson, Alejandro
Rojas, Ignacio
Herrera, Luis Javier
Performance comparison between multi-center histopathology datasets of a weakly-supervised deep learning model for pancreatic ductal adenocarcinoma detection
title Performance comparison between multi-center histopathology datasets of a weakly-supervised deep learning model for pancreatic ductal adenocarcinoma detection
title_full Performance comparison between multi-center histopathology datasets of a weakly-supervised deep learning model for pancreatic ductal adenocarcinoma detection
title_fullStr Performance comparison between multi-center histopathology datasets of a weakly-supervised deep learning model for pancreatic ductal adenocarcinoma detection
title_full_unstemmed Performance comparison between multi-center histopathology datasets of a weakly-supervised deep learning model for pancreatic ductal adenocarcinoma detection
title_short Performance comparison between multi-center histopathology datasets of a weakly-supervised deep learning model for pancreatic ductal adenocarcinoma detection
title_sort performance comparison between multi-center histopathology datasets of a weakly-supervised deep learning model for pancreatic ductal adenocarcinoma detection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10294485/
https://www.ncbi.nlm.nih.gov/pubmed/37365659
http://dx.doi.org/10.1186/s40644-023-00586-3
work_keys_str_mv AT carrilloperezfrancisco performancecomparisonbetweenmulticenterhistopathologydatasetsofaweaklysuperviseddeeplearningmodelforpancreaticductaladenocarcinomadetection
AT ortunofranciscom performancecomparisonbetweenmulticenterhistopathologydatasetsofaweaklysuperviseddeeplearningmodelforpancreaticductaladenocarcinomadetection
AT borjessonalejandro performancecomparisonbetweenmulticenterhistopathologydatasetsofaweaklysuperviseddeeplearningmodelforpancreaticductaladenocarcinomadetection
AT rojasignacio performancecomparisonbetweenmulticenterhistopathologydatasetsofaweaklysuperviseddeeplearningmodelforpancreaticductaladenocarcinomadetection
AT herreraluisjavier performancecomparisonbetweenmulticenterhistopathologydatasetsofaweaklysuperviseddeeplearningmodelforpancreaticductaladenocarcinomadetection