Cargando…

Partial Classifier Chains with Feature Selection by Exploiting Label Correlation in Multi-Label Classification

Multi-label classification (MLC) is a supervised learning problem where an object is naturally associated with multiple concepts because it can be described from various dimensions. How to exploit the resulting label correlations is the key issue in MLC problems. The classifier chain (CC) is a well-...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zhenwu, Wang, Tielin, Wan, Benting, Han, Mengjie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7597295/
https://www.ncbi.nlm.nih.gov/pubmed/33286912
http://dx.doi.org/10.3390/e22101143
_version_ 1783602314937892864
author Wang, Zhenwu
Wang, Tielin
Wan, Benting
Han, Mengjie
author_facet Wang, Zhenwu
Wang, Tielin
Wan, Benting
Han, Mengjie
author_sort Wang, Zhenwu
collection PubMed
description Multi-label classification (MLC) is a supervised learning problem where an object is naturally associated with multiple concepts because it can be described from various dimensions. How to exploit the resulting label correlations is the key issue in MLC problems. The classifier chain (CC) is a well-known MLC approach that can learn complex coupling relationships between labels. CC suffers from two obvious drawbacks: (1) label ordering is decided at random although it usually has a strong effect on predictive performance; (2) all the labels are inserted into the chain, although some of them may carry irrelevant information that discriminates against the others. In this work, we propose a partial classifier chain method with feature selection (PCC-FS) that exploits the label correlation between label and feature spaces and thus solves the two previously mentioned problems simultaneously. In the PCC-FS algorithm, feature selection is performed by learning the covariance between feature set and label set, thus eliminating the irrelevant features that can diminish classification performance. Couplings in the label set are extracted, and the coupled labels of each label are inserted simultaneously into the chain structure to execute the training and prediction activities. The experimental results from five metrics demonstrate that, in comparison to eight state-of-the-art MLC algorithms, the proposed method is a significant improvement on existing multi-label classification.
format Online
Article
Text
id pubmed-7597295
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75972952020-11-09 Partial Classifier Chains with Feature Selection by Exploiting Label Correlation in Multi-Label Classification Wang, Zhenwu Wang, Tielin Wan, Benting Han, Mengjie Entropy (Basel) Article Multi-label classification (MLC) is a supervised learning problem where an object is naturally associated with multiple concepts because it can be described from various dimensions. How to exploit the resulting label correlations is the key issue in MLC problems. The classifier chain (CC) is a well-known MLC approach that can learn complex coupling relationships between labels. CC suffers from two obvious drawbacks: (1) label ordering is decided at random although it usually has a strong effect on predictive performance; (2) all the labels are inserted into the chain, although some of them may carry irrelevant information that discriminates against the others. In this work, we propose a partial classifier chain method with feature selection (PCC-FS) that exploits the label correlation between label and feature spaces and thus solves the two previously mentioned problems simultaneously. In the PCC-FS algorithm, feature selection is performed by learning the covariance between feature set and label set, thus eliminating the irrelevant features that can diminish classification performance. Couplings in the label set are extracted, and the coupled labels of each label are inserted simultaneously into the chain structure to execute the training and prediction activities. The experimental results from five metrics demonstrate that, in comparison to eight state-of-the-art MLC algorithms, the proposed method is a significant improvement on existing multi-label classification. MDPI 2020-10-10 /pmc/articles/PMC7597295/ /pubmed/33286912 http://dx.doi.org/10.3390/e22101143 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Zhenwu
Wang, Tielin
Wan, Benting
Han, Mengjie
Partial Classifier Chains with Feature Selection by Exploiting Label Correlation in Multi-Label Classification
title Partial Classifier Chains with Feature Selection by Exploiting Label Correlation in Multi-Label Classification
title_full Partial Classifier Chains with Feature Selection by Exploiting Label Correlation in Multi-Label Classification
title_fullStr Partial Classifier Chains with Feature Selection by Exploiting Label Correlation in Multi-Label Classification
title_full_unstemmed Partial Classifier Chains with Feature Selection by Exploiting Label Correlation in Multi-Label Classification
title_short Partial Classifier Chains with Feature Selection by Exploiting Label Correlation in Multi-Label Classification
title_sort partial classifier chains with feature selection by exploiting label correlation in multi-label classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7597295/
https://www.ncbi.nlm.nih.gov/pubmed/33286912
http://dx.doi.org/10.3390/e22101143
work_keys_str_mv AT wangzhenwu partialclassifierchainswithfeatureselectionbyexploitinglabelcorrelationinmultilabelclassification
AT wangtielin partialclassifierchainswithfeatureselectionbyexploitinglabelcorrelationinmultilabelclassification
AT wanbenting partialclassifierchainswithfeatureselectionbyexploitinglabelcorrelationinmultilabelclassification
AT hanmengjie partialclassifierchainswithfeatureselectionbyexploitinglabelcorrelationinmultilabelclassification