Cargando…

Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia

The most popular test for pneumonia, a serious health threat to children, is chest X-ray imaging. However, the diagnosis of pneumonia relies on the expertise of experienced radiologists, and the scarcity of medical resources has forced us to conduct research on CAD (computer-aided diagnosis). In thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Zheng, Chen, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9239806/
https://www.ncbi.nlm.nih.gov/pubmed/35774299
http://dx.doi.org/10.1155/2022/7852958
_version_ 1784737387891392512
author Jiang, Zheng
Chen, Liang
author_facet Jiang, Zheng
Chen, Liang
author_sort Jiang, Zheng
collection PubMed
description The most popular test for pneumonia, a serious health threat to children, is chest X-ray imaging. However, the diagnosis of pneumonia relies on the expertise of experienced radiologists, and the scarcity of medical resources has forced us to conduct research on CAD (computer-aided diagnosis). In this study, we propose MP-ViT, the Multisemantic Level Patch Merger Vision Transformer, to achieve automatic diagnosis of pneumonia in chest X-ray images. We introduce Patch Merger to reduce the computational cost of ViT. Meanwhile, the intermediate results calculated by Patch Merger participate in the final classification in a concise way, so as to make full use of the intermediate information of the high-level semantic space to learn from local to overall and to avoid information loss caused by Patch Merger. We conducted experiments on a dataset with 3,883 chest X-ray images described as pneumonia and 1,349 images labeled as normal, and the results show that even without pretraining ViT on a large dataset, our model can achieve the accuracy of 0.91, the precision of 0.92, the recall of 0.89, and the F1-score of 0.90, which is better than Patch Merger on a small dataset. The model can provide CAD for physicians and improve diagnostic reliability.
format Online
Article
Text
id pubmed-9239806
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-92398062022-06-29 Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia Jiang, Zheng Chen, Liang Comput Math Methods Med Research Article The most popular test for pneumonia, a serious health threat to children, is chest X-ray imaging. However, the diagnosis of pneumonia relies on the expertise of experienced radiologists, and the scarcity of medical resources has forced us to conduct research on CAD (computer-aided diagnosis). In this study, we propose MP-ViT, the Multisemantic Level Patch Merger Vision Transformer, to achieve automatic diagnosis of pneumonia in chest X-ray images. We introduce Patch Merger to reduce the computational cost of ViT. Meanwhile, the intermediate results calculated by Patch Merger participate in the final classification in a concise way, so as to make full use of the intermediate information of the high-level semantic space to learn from local to overall and to avoid information loss caused by Patch Merger. We conducted experiments on a dataset with 3,883 chest X-ray images described as pneumonia and 1,349 images labeled as normal, and the results show that even without pretraining ViT on a large dataset, our model can achieve the accuracy of 0.91, the precision of 0.92, the recall of 0.89, and the F1-score of 0.90, which is better than Patch Merger on a small dataset. The model can provide CAD for physicians and improve diagnostic reliability. Hindawi 2022-06-21 /pmc/articles/PMC9239806/ /pubmed/35774299 http://dx.doi.org/10.1155/2022/7852958 Text en Copyright © 2022 Zheng Jiang and Liang Chen. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jiang, Zheng
Chen, Liang
Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia
title Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia
title_full Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia
title_fullStr Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia
title_full_unstemmed Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia
title_short Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia
title_sort multisemantic level patch merger vision transformer for diagnosis of pneumonia
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9239806/
https://www.ncbi.nlm.nih.gov/pubmed/35774299
http://dx.doi.org/10.1155/2022/7852958
work_keys_str_mv AT jiangzheng multisemanticlevelpatchmergervisiontransformerfordiagnosisofpneumonia
AT chenliang multisemanticlevelpatchmergervisiontransformerfordiagnosisofpneumonia