Cargando…

Predicting microsatellite instability and key biomarkers in colorectal cancer from H&E‐stained images: achieving state‐of‐the‐art predictive performance with fewer data using Swin Transformer

Many artificial intelligence models have been developed to predict clinically relevant biomarkers for colorectal cancer (CRC), including microsatellite instability (MSI). However, existing deep learning networks require large training datasets, which are often hard to obtain. In this study, based on...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Bangwei, Li, Xingyu, Yang, Miaomiao, Jonnagaddala, Jitendra, Zhang, Hong, Xu, Xu Steven
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10073932/
https://www.ncbi.nlm.nih.gov/pubmed/36723384
http://dx.doi.org/10.1002/cjp2.312
_version_ 1785019672074125312
author Guo, Bangwei
Li, Xingyu
Yang, Miaomiao
Jonnagaddala, Jitendra
Zhang, Hong
Xu, Xu Steven
author_facet Guo, Bangwei
Li, Xingyu
Yang, Miaomiao
Jonnagaddala, Jitendra
Zhang, Hong
Xu, Xu Steven
author_sort Guo, Bangwei
collection PubMed
description Many artificial intelligence models have been developed to predict clinically relevant biomarkers for colorectal cancer (CRC), including microsatellite instability (MSI). However, existing deep learning networks require large training datasets, which are often hard to obtain. In this study, based on the latest Hierarchical Vision Transformer using Shifted Windows (Swin Transformer [Swin‐T]), we developed an efficient workflow to predict biomarkers in CRC (MSI, hypermutation, chromosomal instability, CpG island methylator phenotype, and BRAF and TP53 mutation) that required relatively small datasets. Our Swin‐T workflow substantially achieved the state‐of‐the‐art (SOTA) predictive performance in an intra‐study cross‐validation experiment on the Cancer Genome Atlas colon and rectal cancer dataset (TCGA‐CRC‐DX). It also demonstrated excellent generalizability in cross‐study external validation and delivered a SOTA area under the receiver operating characteristic curve (AUROC) of 0.90 for MSI, using the Molecular and Cellular Oncology dataset for training (N = 1,065) and the TCGA‐CRC‐DX (N = 462) for testing. A similar performance (AUROC = 0.91) was reported in a recent study, using ~8,000 training samples (ResNet18) on the same testing dataset. Swin‐T was extremely efficient when using small training datasets and exhibited robust predictive performance with 200–500 training samples. Our findings indicate that Swin‐T could be 5–10 times more efficient than existing algorithms for MSI prediction based on ResNet18 and ShuffleNet. Furthermore, the Swin‐T models demonstrated their capability in accurately predicting MSI and BRAF mutation status, which could exclude and therefore reduce samples before subsequent standard testing in a cascading diagnostic workflow, in turn reducing turnaround time and costs.
format Online
Article
Text
id pubmed-10073932
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-100739322023-04-06 Predicting microsatellite instability and key biomarkers in colorectal cancer from H&E‐stained images: achieving state‐of‐the‐art predictive performance with fewer data using Swin Transformer Guo, Bangwei Li, Xingyu Yang, Miaomiao Jonnagaddala, Jitendra Zhang, Hong Xu, Xu Steven J Pathol Clin Res Original Articles Many artificial intelligence models have been developed to predict clinically relevant biomarkers for colorectal cancer (CRC), including microsatellite instability (MSI). However, existing deep learning networks require large training datasets, which are often hard to obtain. In this study, based on the latest Hierarchical Vision Transformer using Shifted Windows (Swin Transformer [Swin‐T]), we developed an efficient workflow to predict biomarkers in CRC (MSI, hypermutation, chromosomal instability, CpG island methylator phenotype, and BRAF and TP53 mutation) that required relatively small datasets. Our Swin‐T workflow substantially achieved the state‐of‐the‐art (SOTA) predictive performance in an intra‐study cross‐validation experiment on the Cancer Genome Atlas colon and rectal cancer dataset (TCGA‐CRC‐DX). It also demonstrated excellent generalizability in cross‐study external validation and delivered a SOTA area under the receiver operating characteristic curve (AUROC) of 0.90 for MSI, using the Molecular and Cellular Oncology dataset for training (N = 1,065) and the TCGA‐CRC‐DX (N = 462) for testing. A similar performance (AUROC = 0.91) was reported in a recent study, using ~8,000 training samples (ResNet18) on the same testing dataset. Swin‐T was extremely efficient when using small training datasets and exhibited robust predictive performance with 200–500 training samples. Our findings indicate that Swin‐T could be 5–10 times more efficient than existing algorithms for MSI prediction based on ResNet18 and ShuffleNet. Furthermore, the Swin‐T models demonstrated their capability in accurately predicting MSI and BRAF mutation status, which could exclude and therefore reduce samples before subsequent standard testing in a cascading diagnostic workflow, in turn reducing turnaround time and costs. John Wiley & Sons, Inc. 2023-02-01 /pmc/articles/PMC10073932/ /pubmed/36723384 http://dx.doi.org/10.1002/cjp2.312 Text en © 2023 The Authors. The Journal of Pathology: Clinical Research published by The Pathological Society of Great Britain and Ireland and John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Original Articles
Guo, Bangwei
Li, Xingyu
Yang, Miaomiao
Jonnagaddala, Jitendra
Zhang, Hong
Xu, Xu Steven
Predicting microsatellite instability and key biomarkers in colorectal cancer from H&E‐stained images: achieving state‐of‐the‐art predictive performance with fewer data using Swin Transformer
title Predicting microsatellite instability and key biomarkers in colorectal cancer from H&E‐stained images: achieving state‐of‐the‐art predictive performance with fewer data using Swin Transformer
title_full Predicting microsatellite instability and key biomarkers in colorectal cancer from H&E‐stained images: achieving state‐of‐the‐art predictive performance with fewer data using Swin Transformer
title_fullStr Predicting microsatellite instability and key biomarkers in colorectal cancer from H&E‐stained images: achieving state‐of‐the‐art predictive performance with fewer data using Swin Transformer
title_full_unstemmed Predicting microsatellite instability and key biomarkers in colorectal cancer from H&E‐stained images: achieving state‐of‐the‐art predictive performance with fewer data using Swin Transformer
title_short Predicting microsatellite instability and key biomarkers in colorectal cancer from H&E‐stained images: achieving state‐of‐the‐art predictive performance with fewer data using Swin Transformer
title_sort predicting microsatellite instability and key biomarkers in colorectal cancer from h&e‐stained images: achieving state‐of‐the‐art predictive performance with fewer data using swin transformer
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10073932/
https://www.ncbi.nlm.nih.gov/pubmed/36723384
http://dx.doi.org/10.1002/cjp2.312
work_keys_str_mv AT guobangwei predictingmicrosatelliteinstabilityandkeybiomarkersincolorectalcancerfromhestainedimagesachievingstateoftheartpredictiveperformancewithfewerdatausingswintransformer
AT lixingyu predictingmicrosatelliteinstabilityandkeybiomarkersincolorectalcancerfromhestainedimagesachievingstateoftheartpredictiveperformancewithfewerdatausingswintransformer
AT yangmiaomiao predictingmicrosatelliteinstabilityandkeybiomarkersincolorectalcancerfromhestainedimagesachievingstateoftheartpredictiveperformancewithfewerdatausingswintransformer
AT jonnagaddalajitendra predictingmicrosatelliteinstabilityandkeybiomarkersincolorectalcancerfromhestainedimagesachievingstateoftheartpredictiveperformancewithfewerdatausingswintransformer
AT zhanghong predictingmicrosatelliteinstabilityandkeybiomarkersincolorectalcancerfromhestainedimagesachievingstateoftheartpredictiveperformancewithfewerdatausingswintransformer
AT xuxusteven predictingmicrosatelliteinstabilityandkeybiomarkersincolorectalcancerfromhestainedimagesachievingstateoftheartpredictiveperformancewithfewerdatausingswintransformer