Cargando…
Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification
Binary classification and anomaly detection face the problem of class imbalance in data sets. The contribution of this paper is to provide an ensemble model that improves image binary classification by reducing the class imbalance between the minority and majority classes in a data set. The ensemble...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Japan
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9437415/ https://www.ncbi.nlm.nih.gov/pubmed/36068817 http://dx.doi.org/10.1007/s10015-022-00781-8 |
_version_ | 1784781604803051520 |
---|---|
author | Aloo, Rogers Mutoh, Atsuko Moriyama, Koichi Matsui, Tohgoroh Inuzuka, Nobuhiro |
author_facet | Aloo, Rogers Mutoh, Atsuko Moriyama, Koichi Matsui, Tohgoroh Inuzuka, Nobuhiro |
author_sort | Aloo, Rogers |
collection | PubMed |
description | Binary classification and anomaly detection face the problem of class imbalance in data sets. The contribution of this paper is to provide an ensemble model that improves image binary classification by reducing the class imbalance between the minority and majority classes in a data set. The ensemble model is a classifier of real images, synthetic images, and metadata associated with the real images. First, we apply a generative model to synthesize images of the minority class from the real image data set. Secondly, we train the ensemble model jointly with synthesized images of the minority class, real images, and metadata. Finally, we evaluate the model performance using a sensitivity metric to observe the difference in classification resulting from the adjustment of class imbalance. Improving the imbalance of the minority class by adding half the size of the majority class we observe an improvement in the classifier’s sensitivity by 12% and 24% for the benchmark pre-trained models of RESNET50 and DENSENet121 respectively. |
format | Online Article Text |
id | pubmed-9437415 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Japan |
record_format | MEDLINE/PubMed |
spelling | pubmed-94374152022-09-02 Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification Aloo, Rogers Mutoh, Atsuko Moriyama, Koichi Matsui, Tohgoroh Inuzuka, Nobuhiro Artif Life Robot Original Article Binary classification and anomaly detection face the problem of class imbalance in data sets. The contribution of this paper is to provide an ensemble model that improves image binary classification by reducing the class imbalance between the minority and majority classes in a data set. The ensemble model is a classifier of real images, synthetic images, and metadata associated with the real images. First, we apply a generative model to synthesize images of the minority class from the real image data set. Secondly, we train the ensemble model jointly with synthesized images of the minority class, real images, and metadata. Finally, we evaluate the model performance using a sensitivity metric to observe the difference in classification resulting from the adjustment of class imbalance. Improving the imbalance of the minority class by adding half the size of the majority class we observe an improvement in the classifier’s sensitivity by 12% and 24% for the benchmark pre-trained models of RESNET50 and DENSENet121 respectively. Springer Japan 2022-09-02 2022 /pmc/articles/PMC9437415/ /pubmed/36068817 http://dx.doi.org/10.1007/s10015-022-00781-8 Text en © International Society of Artificial Life and Robotics (ISAROB) 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Article Aloo, Rogers Mutoh, Atsuko Moriyama, Koichi Matsui, Tohgoroh Inuzuka, Nobuhiro Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification |
title | Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification |
title_full | Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification |
title_fullStr | Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification |
title_full_unstemmed | Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification |
title_short | Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification |
title_sort | ensemble method using real images, metadata and synthetic images for control of class imbalance in classification |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9437415/ https://www.ncbi.nlm.nih.gov/pubmed/36068817 http://dx.doi.org/10.1007/s10015-022-00781-8 |
work_keys_str_mv | AT aloorogers ensemblemethodusingrealimagesmetadataandsyntheticimagesforcontrolofclassimbalanceinclassification AT mutohatsuko ensemblemethodusingrealimagesmetadataandsyntheticimagesforcontrolofclassimbalanceinclassification AT moriyamakoichi ensemblemethodusingrealimagesmetadataandsyntheticimagesforcontrolofclassimbalanceinclassification AT matsuitohgoroh ensemblemethodusingrealimagesmetadataandsyntheticimagesforcontrolofclassimbalanceinclassification AT inuzukanobuhiro ensemblemethodusingrealimagesmetadataandsyntheticimagesforcontrolofclassimbalanceinclassification |