Cargando…

Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification

A comprehensive medical image-based diagnosis is usually performed across various image modalities before passing a final decision; hence, designing a deep learning model that can use any medical image modality to diagnose a particular disease is of great interest. The available methods are multi-st...

Descripción completa

Detalles Bibliográficos
Autores principales: Adeshina, Steve A., Adedigba, Adeyinka P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9311779/
https://www.ncbi.nlm.nih.gov/pubmed/35877363
http://dx.doi.org/10.3390/bioengineering9070312
_version_ 1784753677156745216
author Adeshina, Steve A.
Adedigba, Adeyinka P.
author_facet Adeshina, Steve A.
Adedigba, Adeyinka P.
author_sort Adeshina, Steve A.
collection PubMed
description A comprehensive medical image-based diagnosis is usually performed across various image modalities before passing a final decision; hence, designing a deep learning model that can use any medical image modality to diagnose a particular disease is of great interest. The available methods are multi-staged, with many computational bottlenecks in between. This paper presents an improved end-to-end method of multimodal image classification using deep learning models. We present top research methods developed over the years to improve models trained from scratch and transfer learning approaches. We show that when fully trained, a model can first implicitly discriminate the imaging modality and then diagnose the relevant disease. Our developed models were applied to COVID-19 classification from chest X-ray, CT scan, and lung ultrasound image modalities. The model that achieved the highest accuracy correctly maps all input images to their respective modality, then classifies the disease achieving overall 91.07% accuracy.
format Online
Article
Text
id pubmed-9311779
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93117792022-07-26 Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification Adeshina, Steve A. Adedigba, Adeyinka P. Bioengineering (Basel) Article A comprehensive medical image-based diagnosis is usually performed across various image modalities before passing a final decision; hence, designing a deep learning model that can use any medical image modality to diagnose a particular disease is of great interest. The available methods are multi-staged, with many computational bottlenecks in between. This paper presents an improved end-to-end method of multimodal image classification using deep learning models. We present top research methods developed over the years to improve models trained from scratch and transfer learning approaches. We show that when fully trained, a model can first implicitly discriminate the imaging modality and then diagnose the relevant disease. Our developed models were applied to COVID-19 classification from chest X-ray, CT scan, and lung ultrasound image modalities. The model that achieved the highest accuracy correctly maps all input images to their respective modality, then classifies the disease achieving overall 91.07% accuracy. MDPI 2022-07-13 /pmc/articles/PMC9311779/ /pubmed/35877363 http://dx.doi.org/10.3390/bioengineering9070312 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Adeshina, Steve A.
Adedigba, Adeyinka P.
Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification
title Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification
title_full Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification
title_fullStr Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification
title_full_unstemmed Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification
title_short Bag of Tricks for Improving Deep Learning Performance on Multimodal Image Classification
title_sort bag of tricks for improving deep learning performance on multimodal image classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9311779/
https://www.ncbi.nlm.nih.gov/pubmed/35877363
http://dx.doi.org/10.3390/bioengineering9070312
work_keys_str_mv AT adeshinastevea bagoftricksforimprovingdeeplearningperformanceonmultimodalimageclassification
AT adedigbaadeyinkap bagoftricksforimprovingdeeplearningperformanceonmultimodalimageclassification