Cargando…

Evaluation of two semi-supervised learning methods and their combination for automatic classification of bone marrow cells

Differential bone marrow (BM) cell counting is an important test for the diagnosis of various hematological diseases. However, it is difficult to accurately classify BM cells due to non-uniformity and the lack of reproducibility of differential counting. Therefore, automatic classification systems h...

Descripción completa

Detalles Bibliográficos
Autores principales: Nakamura, Iori, Ida, Haruhi, Yabuta, Mayu, Kashiwa, Wataru, Tsukamoto, Maho, Sato, Shigeki, Ota, Syuichi, Kobayashi, Naoki, Masauzi, Hiromi, Okada, Kazunori, Kaga, Sanae, Miwa, Keiko, Kanai, Hiroshi, Masauzi, Nobuo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9537320/
https://www.ncbi.nlm.nih.gov/pubmed/36202847
http://dx.doi.org/10.1038/s41598-022-20651-4
_version_ 1784803175186825216
author Nakamura, Iori
Ida, Haruhi
Yabuta, Mayu
Kashiwa, Wataru
Tsukamoto, Maho
Sato, Shigeki
Ota, Syuichi
Kobayashi, Naoki
Masauzi, Hiromi
Okada, Kazunori
Kaga, Sanae
Miwa, Keiko
Kanai, Hiroshi
Masauzi, Nobuo
author_facet Nakamura, Iori
Ida, Haruhi
Yabuta, Mayu
Kashiwa, Wataru
Tsukamoto, Maho
Sato, Shigeki
Ota, Syuichi
Kobayashi, Naoki
Masauzi, Hiromi
Okada, Kazunori
Kaga, Sanae
Miwa, Keiko
Kanai, Hiroshi
Masauzi, Nobuo
author_sort Nakamura, Iori
collection PubMed
description Differential bone marrow (BM) cell counting is an important test for the diagnosis of various hematological diseases. However, it is difficult to accurately classify BM cells due to non-uniformity and the lack of reproducibility of differential counting. Therefore, automatic classification systems have been developed in which deep learning is used. These systems requires large and accurately labeled datasets for training. To overcome this, we used semi-supervised learning (SSL), in which learning proceeds while labeling. We used three methods: self-training (ST), active learning (AL), and a combination of these methods, and attempted to automatically classify 16 types of BM cell images. ST involves data verification, as in AL, before adding them to the training dataset (confirmed self-training: CST). After 25 rounds of CST, AL, and CST + AL, the initial number of training data increased from 425 to 40,518; 3682; and 47,843, respectively. Accuracies for the test data of 50 images for each cell type were 0.944, 0.941, and 0.976, respectively. Data added with CST or AL showed some imbalances between classes, while CST + AL exhibited fewer imbalances. We suggest that CST + AL, when combined with two SSL methods, is efficient in increasing training data for the development of automatic BM cells classification systems.
format Online
Article
Text
id pubmed-9537320
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-95373202022-10-08 Evaluation of two semi-supervised learning methods and their combination for automatic classification of bone marrow cells Nakamura, Iori Ida, Haruhi Yabuta, Mayu Kashiwa, Wataru Tsukamoto, Maho Sato, Shigeki Ota, Syuichi Kobayashi, Naoki Masauzi, Hiromi Okada, Kazunori Kaga, Sanae Miwa, Keiko Kanai, Hiroshi Masauzi, Nobuo Sci Rep Article Differential bone marrow (BM) cell counting is an important test for the diagnosis of various hematological diseases. However, it is difficult to accurately classify BM cells due to non-uniformity and the lack of reproducibility of differential counting. Therefore, automatic classification systems have been developed in which deep learning is used. These systems requires large and accurately labeled datasets for training. To overcome this, we used semi-supervised learning (SSL), in which learning proceeds while labeling. We used three methods: self-training (ST), active learning (AL), and a combination of these methods, and attempted to automatically classify 16 types of BM cell images. ST involves data verification, as in AL, before adding them to the training dataset (confirmed self-training: CST). After 25 rounds of CST, AL, and CST + AL, the initial number of training data increased from 425 to 40,518; 3682; and 47,843, respectively. Accuracies for the test data of 50 images for each cell type were 0.944, 0.941, and 0.976, respectively. Data added with CST or AL showed some imbalances between classes, while CST + AL exhibited fewer imbalances. We suggest that CST + AL, when combined with two SSL methods, is efficient in increasing training data for the development of automatic BM cells classification systems. Nature Publishing Group UK 2022-10-06 /pmc/articles/PMC9537320/ /pubmed/36202847 http://dx.doi.org/10.1038/s41598-022-20651-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Nakamura, Iori
Ida, Haruhi
Yabuta, Mayu
Kashiwa, Wataru
Tsukamoto, Maho
Sato, Shigeki
Ota, Syuichi
Kobayashi, Naoki
Masauzi, Hiromi
Okada, Kazunori
Kaga, Sanae
Miwa, Keiko
Kanai, Hiroshi
Masauzi, Nobuo
Evaluation of two semi-supervised learning methods and their combination for automatic classification of bone marrow cells
title Evaluation of two semi-supervised learning methods and their combination for automatic classification of bone marrow cells
title_full Evaluation of two semi-supervised learning methods and their combination for automatic classification of bone marrow cells
title_fullStr Evaluation of two semi-supervised learning methods and their combination for automatic classification of bone marrow cells
title_full_unstemmed Evaluation of two semi-supervised learning methods and their combination for automatic classification of bone marrow cells
title_short Evaluation of two semi-supervised learning methods and their combination for automatic classification of bone marrow cells
title_sort evaluation of two semi-supervised learning methods and their combination for automatic classification of bone marrow cells
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9537320/
https://www.ncbi.nlm.nih.gov/pubmed/36202847
http://dx.doi.org/10.1038/s41598-022-20651-4
work_keys_str_mv AT nakamuraiori evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT idaharuhi evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT yabutamayu evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT kashiwawataru evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT tsukamotomaho evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT satoshigeki evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT otasyuichi evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT kobayashinaoki evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT masauzihiromi evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT okadakazunori evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT kagasanae evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT miwakeiko evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT kanaihiroshi evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells
AT masauzinobuo evaluationoftwosemisupervisedlearningmethodsandtheircombinationforautomaticclassificationofbonemarrowcells