Cargando…

Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification

Due to progression in cell-cycle or duration of storage, classification of morphological changes in human blood cells is important for correct and effective clinical decisions. Automated classification systems help avoid subjective outcomes and are more efficient. Deep learning and more specifically...

Descripción completa

Detalles Bibliográficos
Autores principales: Rana, Priyanka, Sowmya, Arcot, Meijering, Erik, Song, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9613648/
https://www.ncbi.nlm.nih.gov/pubmed/36302948
http://dx.doi.org/10.1038/s41598-022-22882-x
_version_ 1784820021517615104
author Rana, Priyanka
Sowmya, Arcot
Meijering, Erik
Song, Yang
author_facet Rana, Priyanka
Sowmya, Arcot
Meijering, Erik
Song, Yang
author_sort Rana, Priyanka
collection PubMed
description Due to progression in cell-cycle or duration of storage, classification of morphological changes in human blood cells is important for correct and effective clinical decisions. Automated classification systems help avoid subjective outcomes and are more efficient. Deep learning and more specifically Convolutional Neural Networks have achieved state-of-the-art performance on various biomedical image classification problems. However, real-world data often suffers from the data imbalance problem, owing to which the trained classifier is biased towards the majority classes and does not perform well on the minority classes. This study presents an imbalanced blood cells classification method that utilises Wasserstein divergence GAN, mixup and novel nonlinear mixup for data augmentation to achieve oversampling of the minority classes. We also present a minority class focussed sampling strategy, which allows effective representation of minority class samples produced by all three data augmentation techniques and contributes to the classification performance. The method was evaluated on two publicly available datasets of immortalised human T-lymphocyte cells and Red Blood Cells. Classification performance evaluated using F1-score shows that our proposed approach outperforms existing methods on the same datasets.
format Online
Article
Text
id pubmed-9613648
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96136482022-10-29 Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification Rana, Priyanka Sowmya, Arcot Meijering, Erik Song, Yang Sci Rep Article Due to progression in cell-cycle or duration of storage, classification of morphological changes in human blood cells is important for correct and effective clinical decisions. Automated classification systems help avoid subjective outcomes and are more efficient. Deep learning and more specifically Convolutional Neural Networks have achieved state-of-the-art performance on various biomedical image classification problems. However, real-world data often suffers from the data imbalance problem, owing to which the trained classifier is biased towards the majority classes and does not perform well on the minority classes. This study presents an imbalanced blood cells classification method that utilises Wasserstein divergence GAN, mixup and novel nonlinear mixup for data augmentation to achieve oversampling of the minority classes. We also present a minority class focussed sampling strategy, which allows effective representation of minority class samples produced by all three data augmentation techniques and contributes to the classification performance. The method was evaluated on two publicly available datasets of immortalised human T-lymphocyte cells and Red Blood Cells. Classification performance evaluated using F1-score shows that our proposed approach outperforms existing methods on the same datasets. Nature Publishing Group UK 2022-10-27 /pmc/articles/PMC9613648/ /pubmed/36302948 http://dx.doi.org/10.1038/s41598-022-22882-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Rana, Priyanka
Sowmya, Arcot
Meijering, Erik
Song, Yang
Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification
title Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification
title_full Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification
title_fullStr Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification
title_full_unstemmed Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification
title_short Data augmentation with improved regularisation and sampling for imbalanced blood cell image classification
title_sort data augmentation with improved regularisation and sampling for imbalanced blood cell image classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9613648/
https://www.ncbi.nlm.nih.gov/pubmed/36302948
http://dx.doi.org/10.1038/s41598-022-22882-x
work_keys_str_mv AT ranapriyanka dataaugmentationwithimprovedregularisationandsamplingforimbalancedbloodcellimageclassification
AT sowmyaarcot dataaugmentationwithimprovedregularisationandsamplingforimbalancedbloodcellimageclassification
AT meijeringerik dataaugmentationwithimprovedregularisationandsamplingforimbalancedbloodcellimageclassification
AT songyang dataaugmentationwithimprovedregularisationandsamplingforimbalancedbloodcellimageclassification