Cargando…

Dynamic deformable attention network (DDANet) for COVID-19 lesions semantic segmentation

Deep learning based medical image segmentation is an important step within diagnosis, which relies strongly on capturing sufficient spatial context without requiring too complex models that are hard to train with limited labelled data. Training data is in particular scarce for segmenting infection r...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rajamani, Kumar T., Siebert, Hanna, Heinrich, Mattias P.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier Inc. 2021
Materias:	Original Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9246608/ https://www.ncbi.nlm.nih.gov/pubmed/34022421 http://dx.doi.org/10.1016/j.jbi.2021.103816

_version_	1784739003518418944
author	Rajamani, Kumar T. Siebert, Hanna Heinrich, Mattias P.
author_facet	Rajamani, Kumar T. Siebert, Hanna Heinrich, Mattias P.
author_sort	Rajamani, Kumar T.
collection	PubMed
description	Deep learning based medical image segmentation is an important step within diagnosis, which relies strongly on capturing sufficient spatial context without requiring too complex models that are hard to train with limited labelled data. Training data is in particular scarce for segmenting infection regions of CT images of COVID-19 patients. Attention models help gather contextual information within deep networks and benefit semantic segmentation tasks. The recent criss-cross-attention module aims to approximate global self-attention while remaining memory and time efficient by separating horizontal and vertical self-similarity computations. However, capturing attention from all non-local locations can adversely impact the accuracy of semantic segmentation networks. We propose a new Dynamic Deformable Attention Network (DDANet) that enables a more accurate contextual information computation in a similarly efficient way. Our novel technique is based on a deformable criss-cross attention block that learns both attention coefficients and attention offsets in a continuous way. A deep U-Net (Schlemper et al., 2019) segmentation network that employs this attention mechanism is able to capture attention from pertinent non-local locations and also improves the performance on semantic segmentation tasks compared to criss-cross attention within a U-Net on a challenging COVID-19 lesion segmentation task. Our validation experiments show that the performance gain of the recursively applied dynamic deformable attention blocks comes from their ability to capture dynamic and precise attention context. Our DDANet achieves Dice scores of 73.4% and 61.3% for Ground-glass opacity and consolidation lesions for COVID-19 segmentation and improves the accuracy by 4.9% points compared to a baseline U-Net and 24.4% points compared to current state of art methods (Fan et al., 2020).
format	Online Article Text
id	pubmed-9246608
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Elsevier Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-92466082022-07-01 Dynamic deformable attention network (DDANet) for COVID-19 lesions semantic segmentation Rajamani, Kumar T. Siebert, Hanna Heinrich, Mattias P. J Biomed Inform Original Research Deep learning based medical image segmentation is an important step within diagnosis, which relies strongly on capturing sufficient spatial context without requiring too complex models that are hard to train with limited labelled data. Training data is in particular scarce for segmenting infection regions of CT images of COVID-19 patients. Attention models help gather contextual information within deep networks and benefit semantic segmentation tasks. The recent criss-cross-attention module aims to approximate global self-attention while remaining memory and time efficient by separating horizontal and vertical self-similarity computations. However, capturing attention from all non-local locations can adversely impact the accuracy of semantic segmentation networks. We propose a new Dynamic Deformable Attention Network (DDANet) that enables a more accurate contextual information computation in a similarly efficient way. Our novel technique is based on a deformable criss-cross attention block that learns both attention coefficients and attention offsets in a continuous way. A deep U-Net (Schlemper et al., 2019) segmentation network that employs this attention mechanism is able to capture attention from pertinent non-local locations and also improves the performance on semantic segmentation tasks compared to criss-cross attention within a U-Net on a challenging COVID-19 lesion segmentation task. Our validation experiments show that the performance gain of the recursively applied dynamic deformable attention blocks comes from their ability to capture dynamic and precise attention context. Our DDANet achieves Dice scores of 73.4% and 61.3% for Ground-glass opacity and consolidation lesions for COVID-19 segmentation and improves the accuracy by 4.9% points compared to a baseline U-Net and 24.4% points compared to current state of art methods (Fan et al., 2020). Elsevier Inc. 2021-07 2021-05-20 /pmc/articles/PMC9246608/ /pubmed/34022421 http://dx.doi.org/10.1016/j.jbi.2021.103816 Text en © 2021 Elsevier Inc. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle	Original Research Rajamani, Kumar T. Siebert, Hanna Heinrich, Mattias P. Dynamic deformable attention network (DDANet) for COVID-19 lesions semantic segmentation
title	Dynamic deformable attention network (DDANet) for COVID-19 lesions semantic segmentation
title_full	Dynamic deformable attention network (DDANet) for COVID-19 lesions semantic segmentation
title_fullStr	Dynamic deformable attention network (DDANet) for COVID-19 lesions semantic segmentation
title_full_unstemmed	Dynamic deformable attention network (DDANet) for COVID-19 lesions semantic segmentation
title_short	Dynamic deformable attention network (DDANet) for COVID-19 lesions semantic segmentation
title_sort	dynamic deformable attention network (ddanet) for covid-19 lesions semantic segmentation
topic	Original Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9246608/ https://www.ncbi.nlm.nih.gov/pubmed/34022421 http://dx.doi.org/10.1016/j.jbi.2021.103816
work_keys_str_mv	AT rajamanikumart dynamicdeformableattentionnetworkddanetforcovid19lesionssemanticsegmentation AT sieberthanna dynamicdeformableattentionnetworkddanetforcovid19lesionssemanticsegmentation AT heinrichmattiasp dynamicdeformableattentionnetworkddanetforcovid19lesionssemanticsegmentation

Dynamic deformable attention network (DDANet) for COVID-19 lesions semantic segmentation

Ejemplares similares