Cargando…

MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing

OBJECTIVES: Accurate histological typing plays an important role in diagnosing thymoma or thymic carcinoma (TC) and predicting the corresponding prognosis. In this paper, we develop and validate a deep learning-based thymoma typing method for hematoxylin & eosin (H&E)-stained whole slide ima...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Huaqi, Chen, Huang, Qin, Jin, Wang, Bei, Ma, Guolin, Wang, Pengyu, Zhong, Dingrong, Liu, Jie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9659861/
https://www.ncbi.nlm.nih.gov/pubmed/36387248
http://dx.doi.org/10.3389/fonc.2022.925903
_version_ 1784830293441511424
author Zhang, Huaqi
Chen, Huang
Qin, Jin
Wang, Bei
Ma, Guolin
Wang, Pengyu
Zhong, Dingrong
Liu, Jie
author_facet Zhang, Huaqi
Chen, Huang
Qin, Jin
Wang, Bei
Ma, Guolin
Wang, Pengyu
Zhong, Dingrong
Liu, Jie
author_sort Zhang, Huaqi
collection PubMed
description OBJECTIVES: Accurate histological typing plays an important role in diagnosing thymoma or thymic carcinoma (TC) and predicting the corresponding prognosis. In this paper, we develop and validate a deep learning-based thymoma typing method for hematoxylin & eosin (H&E)-stained whole slide images (WSIs), which provides useful histopathology information from patients to assist doctors for better diagnosing thymoma or TC. METHODS: We propose a multi-path cross-scale vision transformer (MC-ViT), which first uses the cross attentive scale-aware transformer (CAST) to classify the pathological information related to thymoma, and then uses such pathological information priors to assist the WSIs transformer (WT) for thymoma typing. To make full use of the multi-scale (10×, 20×, and 40×) information inherent in a WSI, CAST not only employs parallel multi-path to capture different receptive field features from multi-scale WSI inputs, but also introduces the cross-correlation attention module (CAM) to aggregate multi-scale features to achieve cross-scale spatial information complementarity. After that, WT can effectively convert full-scale WSIs into 1D feature matrices with pathological information labels to improve the efficiency and accuracy of thymoma typing. RESULTS: We construct a large-scale thymoma histopathology WSI (THW) dataset and annotate corresponding pathological information and thymoma typing labels. The proposed MC-ViT achieves the Top-1 accuracy of 0.939 and 0.951 in pathological information classification and thymoma typing, respectively. Moreover, the quantitative and statistical experiments on the THW dataset also demonstrate that our pipeline performs favorably against the existing classical convolutional neural networks, vision transformers, and deep learning-based medical image classification methods. CONCLUSION: This paper demonstrates that comprehensively utilizing the pathological information contained in multi-scale WSIs is feasible for thymoma typing and achieves clinically acceptable performance. Specifically, the proposed MC-ViT can well predict pathological information classes as well as thymoma types, which show the application potential to the diagnosis of thymoma and TC and may assist doctors in improving diagnosis efficiency and accuracy.
format Online
Article
Text
id pubmed-9659861
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96598612022-11-15 MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing Zhang, Huaqi Chen, Huang Qin, Jin Wang, Bei Ma, Guolin Wang, Pengyu Zhong, Dingrong Liu, Jie Front Oncol Oncology OBJECTIVES: Accurate histological typing plays an important role in diagnosing thymoma or thymic carcinoma (TC) and predicting the corresponding prognosis. In this paper, we develop and validate a deep learning-based thymoma typing method for hematoxylin & eosin (H&E)-stained whole slide images (WSIs), which provides useful histopathology information from patients to assist doctors for better diagnosing thymoma or TC. METHODS: We propose a multi-path cross-scale vision transformer (MC-ViT), which first uses the cross attentive scale-aware transformer (CAST) to classify the pathological information related to thymoma, and then uses such pathological information priors to assist the WSIs transformer (WT) for thymoma typing. To make full use of the multi-scale (10×, 20×, and 40×) information inherent in a WSI, CAST not only employs parallel multi-path to capture different receptive field features from multi-scale WSI inputs, but also introduces the cross-correlation attention module (CAM) to aggregate multi-scale features to achieve cross-scale spatial information complementarity. After that, WT can effectively convert full-scale WSIs into 1D feature matrices with pathological information labels to improve the efficiency and accuracy of thymoma typing. RESULTS: We construct a large-scale thymoma histopathology WSI (THW) dataset and annotate corresponding pathological information and thymoma typing labels. The proposed MC-ViT achieves the Top-1 accuracy of 0.939 and 0.951 in pathological information classification and thymoma typing, respectively. Moreover, the quantitative and statistical experiments on the THW dataset also demonstrate that our pipeline performs favorably against the existing classical convolutional neural networks, vision transformers, and deep learning-based medical image classification methods. CONCLUSION: This paper demonstrates that comprehensively utilizing the pathological information contained in multi-scale WSIs is feasible for thymoma typing and achieves clinically acceptable performance. Specifically, the proposed MC-ViT can well predict pathological information classes as well as thymoma types, which show the application potential to the diagnosis of thymoma and TC and may assist doctors in improving diagnosis efficiency and accuracy. Frontiers Media S.A. 2022-10-31 /pmc/articles/PMC9659861/ /pubmed/36387248 http://dx.doi.org/10.3389/fonc.2022.925903 Text en Copyright © 2022 Zhang, Chen, Qin, Wang, Ma, Wang, Zhong and Liu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Zhang, Huaqi
Chen, Huang
Qin, Jin
Wang, Bei
Ma, Guolin
Wang, Pengyu
Zhong, Dingrong
Liu, Jie
MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing
title MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing
title_full MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing
title_fullStr MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing
title_full_unstemmed MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing
title_short MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing
title_sort mc-vit: multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9659861/
https://www.ncbi.nlm.nih.gov/pubmed/36387248
http://dx.doi.org/10.3389/fonc.2022.925903
work_keys_str_mv AT zhanghuaqi mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping
AT chenhuang mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping
AT qinjin mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping
AT wangbei mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping
AT maguolin mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping
AT wangpengyu mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping
AT zhongdingrong mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping
AT liujie mcvitmultipathcrossscalevisiontransformerforthymomahistopathologywholeslideimagetyping