Cargando…

Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification

Binary classification and anomaly detection face the problem of class imbalance in data sets. The contribution of this paper is to provide an ensemble model that improves image binary classification by reducing the class imbalance between the minority and majority classes in a data set. The ensemble...

Descripción completa

Detalles Bibliográficos
Autores principales: Aloo, Rogers, Mutoh, Atsuko, Moriyama, Koichi, Matsui, Tohgoroh, Inuzuka, Nobuhiro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Japan 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9437415/
https://www.ncbi.nlm.nih.gov/pubmed/36068817
http://dx.doi.org/10.1007/s10015-022-00781-8
_version_ 1784781604803051520
author Aloo, Rogers
Mutoh, Atsuko
Moriyama, Koichi
Matsui, Tohgoroh
Inuzuka, Nobuhiro
author_facet Aloo, Rogers
Mutoh, Atsuko
Moriyama, Koichi
Matsui, Tohgoroh
Inuzuka, Nobuhiro
author_sort Aloo, Rogers
collection PubMed
description Binary classification and anomaly detection face the problem of class imbalance in data sets. The contribution of this paper is to provide an ensemble model that improves image binary classification by reducing the class imbalance between the minority and majority classes in a data set. The ensemble model is a classifier of real images, synthetic images, and metadata associated with the real images. First, we apply a generative model to synthesize images of the minority class from the real image data set. Secondly, we train the ensemble model jointly with synthesized images of the minority class, real images, and metadata. Finally, we evaluate the model performance using a sensitivity metric to observe the difference in classification resulting from the adjustment of class imbalance. Improving the imbalance of the minority class by adding half the size of the majority class we observe an improvement in the classifier’s sensitivity by 12% and 24% for the benchmark pre-trained models of RESNET50 and DENSENet121 respectively.
format Online
Article
Text
id pubmed-9437415
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Japan
record_format MEDLINE/PubMed
spelling pubmed-94374152022-09-02 Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification Aloo, Rogers Mutoh, Atsuko Moriyama, Koichi Matsui, Tohgoroh Inuzuka, Nobuhiro Artif Life Robot Original Article Binary classification and anomaly detection face the problem of class imbalance in data sets. The contribution of this paper is to provide an ensemble model that improves image binary classification by reducing the class imbalance between the minority and majority classes in a data set. The ensemble model is a classifier of real images, synthetic images, and metadata associated with the real images. First, we apply a generative model to synthesize images of the minority class from the real image data set. Secondly, we train the ensemble model jointly with synthesized images of the minority class, real images, and metadata. Finally, we evaluate the model performance using a sensitivity metric to observe the difference in classification resulting from the adjustment of class imbalance. Improving the imbalance of the minority class by adding half the size of the majority class we observe an improvement in the classifier’s sensitivity by 12% and 24% for the benchmark pre-trained models of RESNET50 and DENSENet121 respectively. Springer Japan 2022-09-02 2022 /pmc/articles/PMC9437415/ /pubmed/36068817 http://dx.doi.org/10.1007/s10015-022-00781-8 Text en © International Society of Artificial Life and Robotics (ISAROB) 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Article
Aloo, Rogers
Mutoh, Atsuko
Moriyama, Koichi
Matsui, Tohgoroh
Inuzuka, Nobuhiro
Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification
title Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification
title_full Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification
title_fullStr Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification
title_full_unstemmed Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification
title_short Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification
title_sort ensemble method using real images, metadata and synthetic images for control of class imbalance in classification
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9437415/
https://www.ncbi.nlm.nih.gov/pubmed/36068817
http://dx.doi.org/10.1007/s10015-022-00781-8
work_keys_str_mv AT aloorogers ensemblemethodusingrealimagesmetadataandsyntheticimagesforcontrolofclassimbalanceinclassification
AT mutohatsuko ensemblemethodusingrealimagesmetadataandsyntheticimagesforcontrolofclassimbalanceinclassification
AT moriyamakoichi ensemblemethodusingrealimagesmetadataandsyntheticimagesforcontrolofclassimbalanceinclassification
AT matsuitohgoroh ensemblemethodusingrealimagesmetadataandsyntheticimagesforcontrolofclassimbalanceinclassification
AT inuzukanobuhiro ensemblemethodusingrealimagesmetadataandsyntheticimagesforcontrolofclassimbalanceinclassification