Cargando…
Dilated transformer: residual axial attention for breast ultrasound image segmentation
BACKGROUND: The segmentation of breast ultrasound (US) images has been a challenging task, mainly due to limited data and the inherent image characteristics involved, such as low contrast and speckle noise. Although convolutional neural network-based (CNN-based) methods have made significant progres...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9403584/ https://www.ncbi.nlm.nih.gov/pubmed/36060605 http://dx.doi.org/10.21037/qims-22-33 |
_version_ | 1784773409986576384 |
---|---|
author | Shen, Xiaoyan Wang, Liangyu Zhao, Yu Liu, Ruibo Qian, Wei Ma, He |
author_facet | Shen, Xiaoyan Wang, Liangyu Zhao, Yu Liu, Ruibo Qian, Wei Ma, He |
author_sort | Shen, Xiaoyan |
collection | PubMed |
description | BACKGROUND: The segmentation of breast ultrasound (US) images has been a challenging task, mainly due to limited data and the inherent image characteristics involved, such as low contrast and speckle noise. Although convolutional neural network-based (CNN-based) methods have made significant progress over the past decade, they lack the ability to model long-range interactions. Recently, the transformer method has been successfully applied to the tasks of computer vision. It has a strong ability to capture distant interactions. However, most transformer-based methods with excellent performance rely on pre-training on large datasets, making it infeasible to directly apply them to medical images analysis, especially that of breast US images with limited high-quality labels. Therefore, it is of great significance to find a robust and efficient transformer-based method for use on small breast US image datasets. METHODS: We developed a dilated transformer (DT) method which mainly uses the proposed residual axial attention layers to build encoder blocks and the introduced dilation module (DM) to further increase the receptive field. We evaluated the proposed method on 2 breast US image datasets using the 5-fold cross-validation method. Dataset A was a public dataset with 562 images, while dataset B was a private dataset with 878 images. Ground truth (GT) was delineated by 2 radiologists with more than 5 years of experience. The evaluation was followed by related ablation experiments. RESULTS: The DT was found to be comparable with the state-of-the-art (SOTA) CNN-based method and outperformed the related transformer-based method, medical transformer (MT), on both datasets. Especially on dataset B, the DT outperformed the MT on metrics of Jaccard index (JI) and Dice similarity coefficient (DSC) by 2.67% and 4.68%, respectively. Meanwhile, when compared with Unet, the DT improved JI and DSC by 4.89% and 4.66%, respectively. Moreover, the results of the ablation experiments showed that each add-on part of the DT is important and contributes to the segmentation accuracy. CONCLUSIONS: The proposed transformer-based method could achieve advanced segmentation performance on different small breast US image datasets without pretraining. |
format | Online Article Text |
id | pubmed-9403584 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-94035842022-09-01 Dilated transformer: residual axial attention for breast ultrasound image segmentation Shen, Xiaoyan Wang, Liangyu Zhao, Yu Liu, Ruibo Qian, Wei Ma, He Quant Imaging Med Surg Original Article BACKGROUND: The segmentation of breast ultrasound (US) images has been a challenging task, mainly due to limited data and the inherent image characteristics involved, such as low contrast and speckle noise. Although convolutional neural network-based (CNN-based) methods have made significant progress over the past decade, they lack the ability to model long-range interactions. Recently, the transformer method has been successfully applied to the tasks of computer vision. It has a strong ability to capture distant interactions. However, most transformer-based methods with excellent performance rely on pre-training on large datasets, making it infeasible to directly apply them to medical images analysis, especially that of breast US images with limited high-quality labels. Therefore, it is of great significance to find a robust and efficient transformer-based method for use on small breast US image datasets. METHODS: We developed a dilated transformer (DT) method which mainly uses the proposed residual axial attention layers to build encoder blocks and the introduced dilation module (DM) to further increase the receptive field. We evaluated the proposed method on 2 breast US image datasets using the 5-fold cross-validation method. Dataset A was a public dataset with 562 images, while dataset B was a private dataset with 878 images. Ground truth (GT) was delineated by 2 radiologists with more than 5 years of experience. The evaluation was followed by related ablation experiments. RESULTS: The DT was found to be comparable with the state-of-the-art (SOTA) CNN-based method and outperformed the related transformer-based method, medical transformer (MT), on both datasets. Especially on dataset B, the DT outperformed the MT on metrics of Jaccard index (JI) and Dice similarity coefficient (DSC) by 2.67% and 4.68%, respectively. Meanwhile, when compared with Unet, the DT improved JI and DSC by 4.89% and 4.66%, respectively. Moreover, the results of the ablation experiments showed that each add-on part of the DT is important and contributes to the segmentation accuracy. CONCLUSIONS: The proposed transformer-based method could achieve advanced segmentation performance on different small breast US image datasets without pretraining. AME Publishing Company 2022-09 /pmc/articles/PMC9403584/ /pubmed/36060605 http://dx.doi.org/10.21037/qims-22-33 Text en 2022 Quantitative Imaging in Medicine and Surgery. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Shen, Xiaoyan Wang, Liangyu Zhao, Yu Liu, Ruibo Qian, Wei Ma, He Dilated transformer: residual axial attention for breast ultrasound image segmentation |
title | Dilated transformer: residual axial attention for breast ultrasound image segmentation |
title_full | Dilated transformer: residual axial attention for breast ultrasound image segmentation |
title_fullStr | Dilated transformer: residual axial attention for breast ultrasound image segmentation |
title_full_unstemmed | Dilated transformer: residual axial attention for breast ultrasound image segmentation |
title_short | Dilated transformer: residual axial attention for breast ultrasound image segmentation |
title_sort | dilated transformer: residual axial attention for breast ultrasound image segmentation |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9403584/ https://www.ncbi.nlm.nih.gov/pubmed/36060605 http://dx.doi.org/10.21037/qims-22-33 |
work_keys_str_mv | AT shenxiaoyan dilatedtransformerresidualaxialattentionforbreastultrasoundimagesegmentation AT wangliangyu dilatedtransformerresidualaxialattentionforbreastultrasoundimagesegmentation AT zhaoyu dilatedtransformerresidualaxialattentionforbreastultrasoundimagesegmentation AT liuruibo dilatedtransformerresidualaxialattentionforbreastultrasoundimagesegmentation AT qianwei dilatedtransformerresidualaxialattentionforbreastultrasoundimagesegmentation AT mahe dilatedtransformerresidualaxialattentionforbreastultrasoundimagesegmentation |