Cargando…
MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis
Medical image analysis continues to hold interesting challenges given the subtle characteristics of certain diseases and the significant overlap in appearance between diseases. In this study, we explore the concept of self-attention for tackling such subtleties in and between diseases. To this end,...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8886730/ https://www.ncbi.nlm.nih.gov/pubmed/35242769 http://dx.doi.org/10.3389/fmed.2021.821120 |
_version_ | 1784660741292294144 |
---|---|
author | Aboutalebi, Hossein Pavlova, Maya Gunraj, Hayden Shafiee, Mohammad Javad Sabri, Ali Alaref, Amer Wong, Alexander |
author_facet | Aboutalebi, Hossein Pavlova, Maya Gunraj, Hayden Shafiee, Mohammad Javad Sabri, Ali Alaref, Amer Wong, Alexander |
author_sort | Aboutalebi, Hossein |
collection | PubMed |
description | Medical image analysis continues to hold interesting challenges given the subtle characteristics of certain diseases and the significant overlap in appearance between diseases. In this study, we explore the concept of self-attention for tackling such subtleties in and between diseases. To this end, we introduce, a multi-scale encoder-decoder self-attention (MEDUSA) mechanism tailored for medical image analysis. While self-attention deep convolutional neural network architectures in existing literature center around the notion of multiple isolated lightweight attention mechanisms with limited individual capacities being incorporated at different points in the network architecture, MEDUSA takes a significant departure from this notion by possessing a single, unified self-attention mechanism with significantly higher capacity with multiple attention heads feeding into different scales in the network architecture. To the best of the authors' knowledge, this is the first “single body, multi-scale heads” realization of self-attention and enables explicit global context among selective attention at different levels of representational abstractions while still enabling differing local attention context at individual levels of abstractions. With MEDUSA, we obtain state-of-the-art performance on multiple challenging medical image analysis benchmarks including COVIDx, Radiological Society of North America (RSNA) RICORD, and RSNA Pneumonia Challenge when compared to previous work. Our MEDUSA model is publicly available. |
format | Online Article Text |
id | pubmed-8886730 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88867302022-03-02 MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis Aboutalebi, Hossein Pavlova, Maya Gunraj, Hayden Shafiee, Mohammad Javad Sabri, Ali Alaref, Amer Wong, Alexander Front Med (Lausanne) Medicine Medical image analysis continues to hold interesting challenges given the subtle characteristics of certain diseases and the significant overlap in appearance between diseases. In this study, we explore the concept of self-attention for tackling such subtleties in and between diseases. To this end, we introduce, a multi-scale encoder-decoder self-attention (MEDUSA) mechanism tailored for medical image analysis. While self-attention deep convolutional neural network architectures in existing literature center around the notion of multiple isolated lightweight attention mechanisms with limited individual capacities being incorporated at different points in the network architecture, MEDUSA takes a significant departure from this notion by possessing a single, unified self-attention mechanism with significantly higher capacity with multiple attention heads feeding into different scales in the network architecture. To the best of the authors' knowledge, this is the first “single body, multi-scale heads” realization of self-attention and enables explicit global context among selective attention at different levels of representational abstractions while still enabling differing local attention context at individual levels of abstractions. With MEDUSA, we obtain state-of-the-art performance on multiple challenging medical image analysis benchmarks including COVIDx, Radiological Society of North America (RSNA) RICORD, and RSNA Pneumonia Challenge when compared to previous work. Our MEDUSA model is publicly available. Frontiers Media S.A. 2022-02-15 /pmc/articles/PMC8886730/ /pubmed/35242769 http://dx.doi.org/10.3389/fmed.2021.821120 Text en Copyright © 2022 Aboutalebi, Pavlova, Gunraj, Shafiee, Sabri, Alaref and Wong. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Medicine Aboutalebi, Hossein Pavlova, Maya Gunraj, Hayden Shafiee, Mohammad Javad Sabri, Ali Alaref, Amer Wong, Alexander MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis |
title | MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis |
title_full | MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis |
title_fullStr | MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis |
title_full_unstemmed | MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis |
title_short | MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis |
title_sort | medusa: multi-scale encoder-decoder self-attention deep neural network architecture for medical image analysis |
topic | Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8886730/ https://www.ncbi.nlm.nih.gov/pubmed/35242769 http://dx.doi.org/10.3389/fmed.2021.821120 |
work_keys_str_mv | AT aboutalebihossein medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis AT pavlovamaya medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis AT gunrajhayden medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis AT shafieemohammadjavad medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis AT sabriali medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis AT alarefamer medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis AT wongalexander medusamultiscaleencoderdecoderselfattentiondeepneuralnetworkarchitectureformedicalimageanalysis |