Cargando…
Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia
The most popular test for pneumonia, a serious health threat to children, is chest X-ray imaging. However, the diagnosis of pneumonia relies on the expertise of experienced radiologists, and the scarcity of medical resources has forced us to conduct research on CAD (computer-aided diagnosis). In thi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9239806/ https://www.ncbi.nlm.nih.gov/pubmed/35774299 http://dx.doi.org/10.1155/2022/7852958 |
_version_ | 1784737387891392512 |
---|---|
author | Jiang, Zheng Chen, Liang |
author_facet | Jiang, Zheng Chen, Liang |
author_sort | Jiang, Zheng |
collection | PubMed |
description | The most popular test for pneumonia, a serious health threat to children, is chest X-ray imaging. However, the diagnosis of pneumonia relies on the expertise of experienced radiologists, and the scarcity of medical resources has forced us to conduct research on CAD (computer-aided diagnosis). In this study, we propose MP-ViT, the Multisemantic Level Patch Merger Vision Transformer, to achieve automatic diagnosis of pneumonia in chest X-ray images. We introduce Patch Merger to reduce the computational cost of ViT. Meanwhile, the intermediate results calculated by Patch Merger participate in the final classification in a concise way, so as to make full use of the intermediate information of the high-level semantic space to learn from local to overall and to avoid information loss caused by Patch Merger. We conducted experiments on a dataset with 3,883 chest X-ray images described as pneumonia and 1,349 images labeled as normal, and the results show that even without pretraining ViT on a large dataset, our model can achieve the accuracy of 0.91, the precision of 0.92, the recall of 0.89, and the F1-score of 0.90, which is better than Patch Merger on a small dataset. The model can provide CAD for physicians and improve diagnostic reliability. |
format | Online Article Text |
id | pubmed-9239806 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-92398062022-06-29 Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia Jiang, Zheng Chen, Liang Comput Math Methods Med Research Article The most popular test for pneumonia, a serious health threat to children, is chest X-ray imaging. However, the diagnosis of pneumonia relies on the expertise of experienced radiologists, and the scarcity of medical resources has forced us to conduct research on CAD (computer-aided diagnosis). In this study, we propose MP-ViT, the Multisemantic Level Patch Merger Vision Transformer, to achieve automatic diagnosis of pneumonia in chest X-ray images. We introduce Patch Merger to reduce the computational cost of ViT. Meanwhile, the intermediate results calculated by Patch Merger participate in the final classification in a concise way, so as to make full use of the intermediate information of the high-level semantic space to learn from local to overall and to avoid information loss caused by Patch Merger. We conducted experiments on a dataset with 3,883 chest X-ray images described as pneumonia and 1,349 images labeled as normal, and the results show that even without pretraining ViT on a large dataset, our model can achieve the accuracy of 0.91, the precision of 0.92, the recall of 0.89, and the F1-score of 0.90, which is better than Patch Merger on a small dataset. The model can provide CAD for physicians and improve diagnostic reliability. Hindawi 2022-06-21 /pmc/articles/PMC9239806/ /pubmed/35774299 http://dx.doi.org/10.1155/2022/7852958 Text en Copyright © 2022 Zheng Jiang and Liang Chen. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Jiang, Zheng Chen, Liang Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia |
title | Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia |
title_full | Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia |
title_fullStr | Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia |
title_full_unstemmed | Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia |
title_short | Multisemantic Level Patch Merger Vision Transformer for Diagnosis of Pneumonia |
title_sort | multisemantic level patch merger vision transformer for diagnosis of pneumonia |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9239806/ https://www.ncbi.nlm.nih.gov/pubmed/35774299 http://dx.doi.org/10.1155/2022/7852958 |
work_keys_str_mv | AT jiangzheng multisemanticlevelpatchmergervisiontransformerfordiagnosisofpneumonia AT chenliang multisemanticlevelpatchmergervisiontransformerfordiagnosisofpneumonia |