Cargando…
Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification
Due to progression in cell-cycle or duration of storage, classification of morphological changes in human blood cells is important for correct and effective clinical decisions. Automated classification systems help avoid subjective outcomes and are more efficient. Deep learning and more specifically...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9613648/ https://www.ncbi.nlm.nih.gov/pubmed/36302948 http://dx.doi.org/10.1038/s41598-022-22882-x |
_version_ | 1784820021517615104 |
---|---|
author | Rana, Priyanka Sowmya, Arcot Meijering, Erik Song, Yang |
author_facet | Rana, Priyanka Sowmya, Arcot Meijering, Erik Song, Yang |
author_sort | Rana, Priyanka |
collection | PubMed |
description | Due to progression in cell-cycle or duration of storage, classification of morphological changes in human blood cells is important for correct and effective clinical decisions. Automated classification systems help avoid subjective outcomes and are more efficient. Deep learning and more specifically Convolutional Neural Networks have achieved state-of-the-art performance on various biomedical image classification problems. However, real-world data often suffers from the data imbalance problem, owing to which the trained classifier is biased towards the majority classes and does not perform well on the minority classes. This study presents an imbalanced blood cells classification method that utilises Wasserstein divergence GAN, mixup and novel nonlinear mixup for data augmentation to achieve oversampling of the minority classes. We also present a minority class focussed sampling strategy, which allows effective representation of minority class samples produced by all three data augmentation techniques and contributes to the classification performance. The method was evaluated on two publicly available datasets of immortalised human T-lymphocyte cells and Red Blood Cells. Classification performance evaluated using F1-score shows that our proposed approach outperforms existing methods on the same datasets. |
format | Online Article Text |
id | pubmed-9613648 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-96136482022-10-29 Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification Rana, Priyanka Sowmya, Arcot Meijering, Erik Song, Yang Sci Rep Article Due to progression in cell-cycle or duration of storage, classification of morphological changes in human blood cells is important for correct and effective clinical decisions. Automated classification systems help avoid subjective outcomes and are more efficient. Deep learning and more specifically Convolutional Neural Networks have achieved state-of-the-art performance on various biomedical image classification problems. However, real-world data often suffers from the data imbalance problem, owing to which the trained classifier is biased towards the majority classes and does not perform well on the minority classes. This study presents an imbalanced blood cells classification method that utilises Wasserstein divergence GAN, mixup and novel nonlinear mixup for data augmentation to achieve oversampling of the minority classes. We also present a minority class focussed sampling strategy, which allows effective representation of minority class samples produced by all three data augmentation techniques and contributes to the classification performance. The method was evaluated on two publicly available datasets of immortalised human T-lymphocyte cells and Red Blood Cells. Classification performance evaluated using F1-score shows that our proposed approach outperforms existing methods on the same datasets. Nature Publishing Group UK 2022-10-27 /pmc/articles/PMC9613648/ /pubmed/36302948 http://dx.doi.org/10.1038/s41598-022-22882-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Rana, Priyanka Sowmya, Arcot Meijering, Erik Song, Yang Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification |
title | Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification |
title_full | Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification |
title_fullStr | Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification |
title_full_unstemmed | Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification |
title_short | Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification |
title_sort | data augmentation with improved regularisation and sampling for imbalanced blood cell image classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9613648/ https://www.ncbi.nlm.nih.gov/pubmed/36302948 http://dx.doi.org/10.1038/s41598-022-22882-x |
work_keys_str_mv | AT ranapriyanka dataaugmentationwithimprovedregularisationandsamplingforimbalancedbloodcellimageclassification AT sowmyaarcot dataaugmentationwithimprovedregularisationandsamplingforimbalancedbloodcellimageclassification AT meijeringerik dataaugmentationwithimprovedregularisationandsamplingforimbalancedbloodcellimageclassification AT songyang dataaugmentationwithimprovedregularisationandsamplingforimbalancedbloodcellimageclassification |