Cargando…

Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography

Limited availability of medical imaging datasets is a vital limitation when using “data hungry” deep learning to gain performance improvements. Dealing with the issue, transfer learning has become a de facto standard, where a pre-trained convolution neural network (CNN), typically on natural images...

Descripción completa

Detalles Bibliográficos
Autores principales: Usman, Mohammad, Zia, Tehseen, Tariq, Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9274969/
https://www.ncbi.nlm.nih.gov/pubmed/35819537
http://dx.doi.org/10.1007/s10278-022-00666-z
_version_ 1784745398960652288
author Usman, Mohammad
Zia, Tehseen
Tariq, Ali
author_facet Usman, Mohammad
Zia, Tehseen
Tariq, Ali
author_sort Usman, Mohammad
collection PubMed
description Limited availability of medical imaging datasets is a vital limitation when using “data hungry” deep learning to gain performance improvements. Dealing with the issue, transfer learning has become a de facto standard, where a pre-trained convolution neural network (CNN), typically on natural images (e.g., ImageNet), is finetuned on medical images. Meanwhile, pre-trained transformers, which are self-attention-based models, have become de facto standard in natural language processing (NLP) and state of the art in image classification due to their powerful transfer learning abilities. Inspired by the success of transformers in NLP and image classification, large-scale transformers (such as vision transformer) are trained on natural images. Based on these recent developments, this research aims to explore the efficacy of pre-trained natural image transformers for medical images. Specifically, we analyze pre-trained vision transformer on CheXpert and pediatric pneumonia dataset. We use CNN standard models including VGGNet and ResNet as baseline models. By examining the acquired representations and results, we discover that transfer learning from the pre-trained vision transformer shows improved results as compared to pre-trained CNN which demonstrates a greater transfer ability of the transformers in medical imaging. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10278-022-00666-z.
format Online
Article
Text
id pubmed-9274969
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-92749692022-07-14 Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography Usman, Mohammad Zia, Tehseen Tariq, Ali J Digit Imaging Original Paper Limited availability of medical imaging datasets is a vital limitation when using “data hungry” deep learning to gain performance improvements. Dealing with the issue, transfer learning has become a de facto standard, where a pre-trained convolution neural network (CNN), typically on natural images (e.g., ImageNet), is finetuned on medical images. Meanwhile, pre-trained transformers, which are self-attention-based models, have become de facto standard in natural language processing (NLP) and state of the art in image classification due to their powerful transfer learning abilities. Inspired by the success of transformers in NLP and image classification, large-scale transformers (such as vision transformer) are trained on natural images. Based on these recent developments, this research aims to explore the efficacy of pre-trained natural image transformers for medical images. Specifically, we analyze pre-trained vision transformer on CheXpert and pediatric pneumonia dataset. We use CNN standard models including VGGNet and ResNet as baseline models. By examining the acquired representations and results, we discover that transfer learning from the pre-trained vision transformer shows improved results as compared to pre-trained CNN which demonstrates a greater transfer ability of the transformers in medical imaging. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10278-022-00666-z. Springer International Publishing 2022-07-11 2022-12 /pmc/articles/PMC9274969/ /pubmed/35819537 http://dx.doi.org/10.1007/s10278-022-00666-z Text en © The Author(s) under exclusive licence to Society for Imaging Informatics in Medicine 2022
spellingShingle Original Paper
Usman, Mohammad
Zia, Tehseen
Tariq, Ali
Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography
title Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography
title_full Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography
title_fullStr Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography
title_full_unstemmed Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography
title_short Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography
title_sort analyzing transfer learning of vision transformers for interpreting chest radiography
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9274969/
https://www.ncbi.nlm.nih.gov/pubmed/35819537
http://dx.doi.org/10.1007/s10278-022-00666-z
work_keys_str_mv AT usmanmohammad analyzingtransferlearningofvisiontransformersforinterpretingchestradiography
AT ziatehseen analyzingtransferlearningofvisiontransformersforinterpretingchestradiography
AT tariqali analyzingtransferlearningofvisiontransformersforinterpretingchestradiography