Cargando…

Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures

Medical imaging refers to visualization techniques to provide valuable information about the internal structures of the human body for clinical applications, diagnosis, treatment, and scientific research. Segmentation is one of the primary methods for analyzing and processing medical images, which h...

Descripción completa

Detalles Bibliográficos
Autores principales: Abedalla, Ayat, Abdullah, Malak, Al-Ayyoub, Mahmoud, Benkhelifa, Elhadj
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8279140/
https://www.ncbi.nlm.nih.gov/pubmed/34307860
http://dx.doi.org/10.7717/peerj-cs.607
_version_ 1783722396047376384
author Abedalla, Ayat
Abdullah, Malak
Al-Ayyoub, Mahmoud
Benkhelifa, Elhadj
author_facet Abedalla, Ayat
Abdullah, Malak
Al-Ayyoub, Mahmoud
Benkhelifa, Elhadj
author_sort Abedalla, Ayat
collection PubMed
description Medical imaging refers to visualization techniques to provide valuable information about the internal structures of the human body for clinical applications, diagnosis, treatment, and scientific research. Segmentation is one of the primary methods for analyzing and processing medical images, which helps doctors diagnose accurately by providing detailed information on the body’s required part. However, segmenting medical images faces several challenges, such as requiring trained medical experts and being time-consuming and error-prone. Thus, it appears necessary for an automatic medical image segmentation system. Deep learning algorithms have recently shown outstanding performance for segmentation tasks, especially semantic segmentation networks that provide pixel-level image understanding. By introducing the first fully convolutional network (FCN) for semantic image segmentation, several segmentation networks have been proposed on its basis. One of the state-of-the-art convolutional networks in the medical image field is U-Net. This paper presents a novel end-to-end semantic segmentation model, named Ens4B-UNet, for medical images that ensembles four U-Net architectures with pre-trained backbone networks. Ens4B-UNet utilizes U-Net’s success with several significant improvements by adapting powerful and robust convolutional neural networks (CNNs) as backbones for U-Nets encoders and using the nearest-neighbor up-sampling in the decoders. Ens4B-UNet is designed based on the weighted average ensemble of four encoder-decoder segmentation models. The backbone networks of all ensembled models are pre-trained on the ImageNet dataset to exploit the benefit of transfer learning. For improving our models, we apply several techniques for training and predicting, including stochastic weight averaging (SWA), data augmentation, test-time augmentation (TTA), and different types of optimal thresholds. We evaluate and test our models on the 2019 Pneumothorax Challenge dataset, which contains 12,047 training images with 12,954 masks and 3,205 test images. Our proposed segmentation network achieves a 0.8608 mean Dice similarity coefficient (DSC) on the test set, which is among the top one-percent systems in the Kaggle competition.
format Online
Article
Text
id pubmed-8279140
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-82791402021-07-22 Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures Abedalla, Ayat Abdullah, Malak Al-Ayyoub, Mahmoud Benkhelifa, Elhadj PeerJ Comput Sci Bioinformatics Medical imaging refers to visualization techniques to provide valuable information about the internal structures of the human body for clinical applications, diagnosis, treatment, and scientific research. Segmentation is one of the primary methods for analyzing and processing medical images, which helps doctors diagnose accurately by providing detailed information on the body’s required part. However, segmenting medical images faces several challenges, such as requiring trained medical experts and being time-consuming and error-prone. Thus, it appears necessary for an automatic medical image segmentation system. Deep learning algorithms have recently shown outstanding performance for segmentation tasks, especially semantic segmentation networks that provide pixel-level image understanding. By introducing the first fully convolutional network (FCN) for semantic image segmentation, several segmentation networks have been proposed on its basis. One of the state-of-the-art convolutional networks in the medical image field is U-Net. This paper presents a novel end-to-end semantic segmentation model, named Ens4B-UNet, for medical images that ensembles four U-Net architectures with pre-trained backbone networks. Ens4B-UNet utilizes U-Net’s success with several significant improvements by adapting powerful and robust convolutional neural networks (CNNs) as backbones for U-Nets encoders and using the nearest-neighbor up-sampling in the decoders. Ens4B-UNet is designed based on the weighted average ensemble of four encoder-decoder segmentation models. The backbone networks of all ensembled models are pre-trained on the ImageNet dataset to exploit the benefit of transfer learning. For improving our models, we apply several techniques for training and predicting, including stochastic weight averaging (SWA), data augmentation, test-time augmentation (TTA), and different types of optimal thresholds. We evaluate and test our models on the 2019 Pneumothorax Challenge dataset, which contains 12,047 training images with 12,954 masks and 3,205 test images. Our proposed segmentation network achieves a 0.8608 mean Dice similarity coefficient (DSC) on the test set, which is among the top one-percent systems in the Kaggle competition. PeerJ Inc. 2021-06-29 /pmc/articles/PMC8279140/ /pubmed/34307860 http://dx.doi.org/10.7717/peerj-cs.607 Text en © 2021 Abedalla et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Abedalla, Ayat
Abdullah, Malak
Al-Ayyoub, Mahmoud
Benkhelifa, Elhadj
Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures
title Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures
title_full Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures
title_fullStr Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures
title_full_unstemmed Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures
title_short Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures
title_sort chest x-ray pneumothorax segmentation using u-net with efficientnet and resnet architectures
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8279140/
https://www.ncbi.nlm.nih.gov/pubmed/34307860
http://dx.doi.org/10.7717/peerj-cs.607
work_keys_str_mv AT abedallaayat chestxraypneumothoraxsegmentationusingunetwithefficientnetandresnetarchitectures
AT abdullahmalak chestxraypneumothoraxsegmentationusingunetwithefficientnetandresnetarchitectures
AT alayyoubmahmoud chestxraypneumothoraxsegmentationusingunetwithefficientnetandresnetarchitectures
AT benkhelifaelhadj chestxraypneumothoraxsegmentationusingunetwithefficientnetandresnetarchitectures