Cargando…
MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing
OBJECTIVES: Accurate histological typing plays an important role in diagnosing thymoma or thymic carcinoma (TC) and predicting the corresponding prognosis. In this paper, we develop and validate a deep learning-based thymoma typing method for hematoxylin & eosin (H&E)-stained whole slide ima...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9659861/ https://www.ncbi.nlm.nih.gov/pubmed/36387248 http://dx.doi.org/10.3389/fonc.2022.925903 |
_version_ | 1784830293441511424 |
---|---|
author | Zhang, Huaqi Chen, Huang Qin, Jin Wang, Bei Ma, Guolin Wang, Pengyu Zhong, Dingrong Liu, Jie |
author_facet | Zhang, Huaqi Chen, Huang Qin, Jin Wang, Bei Ma, Guolin Wang, Pengyu Zhong, Dingrong Liu, Jie |
author_sort | Zhang, Huaqi |
collection | PubMed |
description | OBJECTIVES: Accurate histological typing plays an important role in diagnosing thymoma or thymic carcinoma (TC) and predicting the corresponding prognosis. In this paper, we develop and validate a deep learning-based thymoma typing method for hematoxylin & eosin (H&E)-stained whole slide images (WSIs), which provides useful histopathology information from patients to assist doctors for better diagnosing thymoma or TC. METHODS: We propose a multi-path cross-scale vision transformer (MC-ViT), which first uses the cross attentive scale-aware transformer (CAST) to classify the pathological information related to thymoma, and then uses such pathological information priors to assist the WSIs transformer (WT) for thymoma typing. To make full use of the multi-scale (10×, 20×, and 40×) information inherent in a WSI, CAST not only employs parallel multi-path to capture different receptive field features from multi-scale WSI inputs, but also introduces the cross-correlation attention module (CAM) to aggregate multi-scale features to achieve cross-scale spatial information complementarity. After that, WT can effectively convert full-scale WSIs into 1D feature matrices with pathological information labels to improve the efficiency and accuracy of thymoma typing. RESULTS: We construct a large-scale thymoma histopathology WSI (THW) dataset and annotate corresponding pathological information and thymoma typing labels. The proposed MC-ViT achieves the Top-1 accuracy of 0.939 and 0.951 in pathological information classification and thymoma typing, respectively. Moreover, the quantitative and statistical experiments on the THW dataset also demonstrate that our pipeline performs favorably against the existing classical convolutional neural networks, vision transformers, and deep learning-based medical image classification methods. CONCLUSION: This paper demonstrates that comprehensively utilizing the pathological information contained in multi-scale WSIs is feasible for thymoma typing and achieves clinically acceptable performance. Specifically, the proposed MC-ViT can well predict pathological information classes as well as thymoma types, which show the application potential to the diagnosis of thymoma and TC and may assist doctors in improving diagnosis efficiency and accuracy. |
format | Online Article Text |
id | pubmed-9659861 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-96598612022-11-15 MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing Zhang, Huaqi Chen, Huang Qin, Jin Wang, Bei Ma, Guolin Wang, Pengyu Zhong, Dingrong Liu, Jie Front Oncol Oncology OBJECTIVES: Accurate histological typing plays an important role in diagnosing thymoma or thymic carcinoma (TC) and predicting the corresponding prognosis. In this paper, we develop and validate a deep learning-based thymoma typing method for hematoxylin & eosin (H&E)-stained whole slide images (WSIs), which provides useful histopathology information from patients to assist doctors for better diagnosing thymoma or TC. METHODS: We propose a multi-path cross-scale vision transformer (MC-ViT), which first uses the cross attentive scale-aware transformer (CAST) to classify the pathological information related to thymoma, and then uses such pathological information priors to assist the WSIs transformer (WT) for thymoma typing. To make full use of the multi-scale (10×, 20×, and 40×) information inherent in a WSI, CAST not only employs parallel multi-path to capture different receptive field features from multi-scale WSI inputs, but also introduces the cross-correlation attention module (CAM) to aggregate multi-scale features to achieve cross-scale spatial information complementarity. After that, WT can effectively convert full-scale WSIs into 1D feature matrices with pathological information labels to improve the efficiency and accuracy of thymoma typing. RESULTS: We construct a large-scale thymoma histopathology WSI (THW) dataset and annotate corresponding pathological information and thymoma typing labels. The proposed MC-ViT achieves the Top-1 accuracy of 0.939 and 0.951 in pathological information classification and thymoma typing, respectively. Moreover, the quantitative and statistical experiments on the THW dataset also demonstrate that our pipeline performs favorably against the existing classical convolutional neural networks, vision transformers, and deep learning-based medical image classification methods. CONCLUSION: This paper demonstrates that comprehensively utilizing the pathological information contained in multi-scale WSIs is feasible for thymoma typing and achieves clinically acceptable performance. Specifically, the proposed MC-ViT can well predict pathological information classes as well as thymoma types, which show the application potential to the diagnosis of thymoma and TC and may assist doctors in improving diagnosis efficiency and accuracy. Frontiers Media S.A. 2022-10-31 /pmc/articles/PMC9659861/ /pubmed/36387248 http://dx.doi.org/10.3389/fonc.2022.925903 Text en Copyright © 2022 Zhang, Chen, Qin, Wang, Ma, Wang, Zhong and Liu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Oncology Zhang, Huaqi Chen, Huang Qin, Jin Wang, Bei Ma, Guolin Wang, Pengyu Zhong, Dingrong Liu, Jie MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing |
title | MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing |
title_full | MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing |
title_fullStr | MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing |
title_full_unstemmed | MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing |
title_short | MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing |
title_sort | mc-vit: multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9659861/ https://www.ncbi.nlm.nih.gov/pubmed/36387248 http://dx.doi.org/10.3389/fonc.2022.925903 |
work_keys_str_mv | AT zhanghuaqi mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping AT chenhuang mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping AT qinjin mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping AT wangbei mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping AT maguolin mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping AT wangpengyu mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping AT zhongdingrong mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping AT liujie mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping |