Cargando…
Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification
A comprehensive medical image-based diagnosis is usually performed across various image modalities before passing a final decision; hence, designing a deep learning model that can use any medical image modality to diagnose a particular disease is of great interest. The available methods are multi-st...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9311779/ https://www.ncbi.nlm.nih.gov/pubmed/35877363 http://dx.doi.org/10.3390/bioengineering9070312 |
_version_ | 1784753677156745216 |
---|---|
author | Adeshina, Steve A. Adedigba, Adeyinka P. |
author_facet | Adeshina, Steve A. Adedigba, Adeyinka P. |
author_sort | Adeshina, Steve A. |
collection | PubMed |
description | A comprehensive medical image-based diagnosis is usually performed across various image modalities before passing a final decision; hence, designing a deep learning model that can use any medical image modality to diagnose a particular disease is of great interest. The available methods are multi-staged, with many computational bottlenecks in between. This paper presents an improved end-to-end method of multimodal image classification using deep learning models. We present top research methods developed over the years to improve models trained from scratch and transfer learning approaches. We show that when fully trained, a model can first implicitly discriminate the imaging modality and then diagnose the relevant disease. Our developed models were applied to COVID-19 classification from chest X-ray, CT scan, and lung ultrasound image modalities. The model that achieved the highest accuracy correctly maps all input images to their respective modality, then classifies the disease achieving overall 91.07% accuracy. |
format | Online Article Text |
id | pubmed-9311779 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-93117792022-07-26 Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification Adeshina, Steve A. Adedigba, Adeyinka P. Bioengineering (Basel) Article A comprehensive medical image-based diagnosis is usually performed across various image modalities before passing a final decision; hence, designing a deep learning model that can use any medical image modality to diagnose a particular disease is of great interest. The available methods are multi-staged, with many computational bottlenecks in between. This paper presents an improved end-to-end method of multimodal image classification using deep learning models. We present top research methods developed over the years to improve models trained from scratch and transfer learning approaches. We show that when fully trained, a model can first implicitly discriminate the imaging modality and then diagnose the relevant disease. Our developed models were applied to COVID-19 classification from chest X-ray, CT scan, and lung ultrasound image modalities. The model that achieved the highest accuracy correctly maps all input images to their respective modality, then classifies the disease achieving overall 91.07% accuracy. MDPI 2022-07-13 /pmc/articles/PMC9311779/ /pubmed/35877363 http://dx.doi.org/10.3390/bioengineering9070312 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Adeshina, Steve A. Adedigba, Adeyinka P. Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification |
title | Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification |
title_full | Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification |
title_fullStr | Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification |
title_full_unstemmed | Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification |
title_short | Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification |
title_sort | bag of tricks for improving deep learning performance on multimodal image classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9311779/ https://www.ncbi.nlm.nih.gov/pubmed/35877363 http://dx.doi.org/10.3390/bioengineering9070312 |
work_keys_str_mv | AT adeshinastevea bagoftricksforimprovingdeeplearningperformanceonmultimodalimageclassification AT adedigbaadeyinkap bagoftricksforimprovingdeeplearningperformanceonmultimodalimageclassification |