Cargando…

MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis

Medical image analysis continues to hold interesting challenges given the subtle characteristics of certain diseases and the significant overlap in appearance between diseases. In this study, we explore the concept of self-attention for tackling such subtleties in and between diseases. To this end,...

Descripción completa

Detalles Bibliográficos
Autores principales: Aboutalebi, Hossein, Pavlova, Maya, Gunraj, Hayden, Shafiee, Mohammad Javad, Sabri, Ali, Alaref, Amer, Wong, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8886730/
https://www.ncbi.nlm.nih.gov/pubmed/35242769
http://dx.doi.org/10.3389/fmed.2021.821120
_version_ 1784660741292294144
author Aboutalebi, Hossein
Pavlova, Maya
Gunraj, Hayden
Shafiee, Mohammad Javad
Sabri, Ali
Alaref, Amer
Wong, Alexander
author_facet Aboutalebi, Hossein
Pavlova, Maya
Gunraj, Hayden
Shafiee, Mohammad Javad
Sabri, Ali
Alaref, Amer
Wong, Alexander
author_sort Aboutalebi, Hossein
collection PubMed
description Medical image analysis continues to hold interesting challenges given the subtle characteristics of certain diseases and the significant overlap in appearance between diseases. In this study, we explore the concept of self-attention for tackling such subtleties in and between diseases. To this end, we introduce, a multi-scale encoder-decoder self-attention (MEDUSA) mechanism tailored for medical image analysis. While self-attention deep convolutional neural network architectures in existing literature center around the notion of multiple isolated lightweight attention mechanisms with limited individual capacities being incorporated at different points in the network architecture, MEDUSA takes a significant departure from this notion by possessing a single, unified self-attention mechanism with significantly higher capacity with multiple attention heads feeding into different scales in the network architecture. To the best of the authors' knowledge, this is the first “single body, multi-scale heads” realization of self-attention and enables explicit global context among selective attention at different levels of representational abstractions while still enabling differing local attention context at individual levels of abstractions. With MEDUSA, we obtain state-of-the-art performance on multiple challenging medical image analysis benchmarks including COVIDx, Radiological Society of North America (RSNA) RICORD, and RSNA Pneumonia Challenge when compared to previous work. Our MEDUSA model is publicly available.
format Online
Article
Text
id pubmed-8886730
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88867302022-03-02 MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis Aboutalebi, Hossein Pavlova, Maya Gunraj, Hayden Shafiee, Mohammad Javad Sabri, Ali Alaref, Amer Wong, Alexander Front Med (Lausanne) Medicine Medical image analysis continues to hold interesting challenges given the subtle characteristics of certain diseases and the significant overlap in appearance between diseases. In this study, we explore the concept of self-attention for tackling such subtleties in and between diseases. To this end, we introduce, a multi-scale encoder-decoder self-attention (MEDUSA) mechanism tailored for medical image analysis. While self-attention deep convolutional neural network architectures in existing literature center around the notion of multiple isolated lightweight attention mechanisms with limited individual capacities being incorporated at different points in the network architecture, MEDUSA takes a significant departure from this notion by possessing a single, unified self-attention mechanism with significantly higher capacity with multiple attention heads feeding into different scales in the network architecture. To the best of the authors' knowledge, this is the first “single body, multi-scale heads” realization of self-attention and enables explicit global context among selective attention at different levels of representational abstractions while still enabling differing local attention context at individual levels of abstractions. With MEDUSA, we obtain state-of-the-art performance on multiple challenging medical image analysis benchmarks including COVIDx, Radiological Society of North America (RSNA) RICORD, and RSNA Pneumonia Challenge when compared to previous work. Our MEDUSA model is publicly available. Frontiers Media S.A. 2022-02-15 /pmc/articles/PMC8886730/ /pubmed/35242769 http://dx.doi.org/10.3389/fmed.2021.821120 Text en Copyright © 2022 Aboutalebi, Pavlova, Gunraj, Shafiee, Sabri, Alaref and Wong. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Aboutalebi, Hossein
Pavlova, Maya
Gunraj, Hayden
Shafiee, Mohammad Javad
Sabri, Ali
Alaref, Amer
Wong, Alexander
MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis
title MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis
title_full MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis
title_fullStr MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis
title_full_unstemmed MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis
title_short MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis
title_sort medusa: multi-scale encoder-decoder self-attention deep neural network architecture for medical image analysis
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8886730/
https://www.ncbi.nlm.nih.gov/pubmed/35242769
http://dx.doi.org/10.3389/fmed.2021.821120
work_keys_str_mv AT aboutalebihossein medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis
AT pavlovamaya medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis
AT gunrajhayden medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis
AT shafieemohammadjavad medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis
AT sabriali medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis
AT alarefamer medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis
AT wongalexander medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis