Cargando…

Multi-Label Feature Selection Combining Three Types of Conditional Relevance

With the rapid growth of the Internet, the curse of dimensionality caused by massive multi-label data has attracted extensive attention. Feature selection plays an indispensable role in dimensionality reduction processing. Many researchers have focused on this subject based on information theory. He...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Lingbo, Wang, Yiqiang, Li, Yonghao, Zhang, Ping, Hu, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700541/
https://www.ncbi.nlm.nih.gov/pubmed/34945923
http://dx.doi.org/10.3390/e23121617
_version_ 1784620781795278848
author Gao, Lingbo
Wang, Yiqiang
Li, Yonghao
Zhang, Ping
Hu, Liang
author_facet Gao, Lingbo
Wang, Yiqiang
Li, Yonghao
Zhang, Ping
Hu, Liang
author_sort Gao, Lingbo
collection PubMed
description With the rapid growth of the Internet, the curse of dimensionality caused by massive multi-label data has attracted extensive attention. Feature selection plays an indispensable role in dimensionality reduction processing. Many researchers have focused on this subject based on information theory. Here, to evaluate feature relevance, a novel feature relevance term (FR) that employs three incremental information terms to comprehensively consider three key aspects (candidate features, selected features, and label correlations) is designed. A thorough examination of the three key aspects of FR outlined above is more favorable to capturing the optimal features. Moreover, we employ label-related feature redundancy as the label-related feature redundancy term (LR) to reduce unnecessary redundancy. Therefore, a designed multi-label feature selection method that integrates FR with LR is proposed, namely, Feature Selection combining three types of Conditional Relevance (TCRFS). Numerous experiments indicate that TCRFS outperforms the other 6 state-of-the-art multi-label approaches on 13 multi-label benchmark data sets from 4 domains.
format Online
Article
Text
id pubmed-8700541
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87005412021-12-24 Multi-Label Feature Selection Combining Three Types of Conditional Relevance Gao, Lingbo Wang, Yiqiang Li, Yonghao Zhang, Ping Hu, Liang Entropy (Basel) Article With the rapid growth of the Internet, the curse of dimensionality caused by massive multi-label data has attracted extensive attention. Feature selection plays an indispensable role in dimensionality reduction processing. Many researchers have focused on this subject based on information theory. Here, to evaluate feature relevance, a novel feature relevance term (FR) that employs three incremental information terms to comprehensively consider three key aspects (candidate features, selected features, and label correlations) is designed. A thorough examination of the three key aspects of FR outlined above is more favorable to capturing the optimal features. Moreover, we employ label-related feature redundancy as the label-related feature redundancy term (LR) to reduce unnecessary redundancy. Therefore, a designed multi-label feature selection method that integrates FR with LR is proposed, namely, Feature Selection combining three types of Conditional Relevance (TCRFS). Numerous experiments indicate that TCRFS outperforms the other 6 state-of-the-art multi-label approaches on 13 multi-label benchmark data sets from 4 domains. MDPI 2021-12-01 /pmc/articles/PMC8700541/ /pubmed/34945923 http://dx.doi.org/10.3390/e23121617 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gao, Lingbo
Wang, Yiqiang
Li, Yonghao
Zhang, Ping
Hu, Liang
Multi-Label Feature Selection Combining Three Types of Conditional Relevance
title Multi-Label Feature Selection Combining Three Types of Conditional Relevance
title_full Multi-Label Feature Selection Combining Three Types of Conditional Relevance
title_fullStr Multi-Label Feature Selection Combining Three Types of Conditional Relevance
title_full_unstemmed Multi-Label Feature Selection Combining Three Types of Conditional Relevance
title_short Multi-Label Feature Selection Combining Three Types of Conditional Relevance
title_sort multi-label feature selection combining three types of conditional relevance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700541/
https://www.ncbi.nlm.nih.gov/pubmed/34945923
http://dx.doi.org/10.3390/e23121617
work_keys_str_mv AT gaolingbo multilabelfeatureselectioncombiningthreetypesofconditionalrelevance
AT wangyiqiang multilabelfeatureselectioncombiningthreetypesofconditionalrelevance
AT liyonghao multilabelfeatureselectioncombiningthreetypesofconditionalrelevance
AT zhangping multilabelfeatureselectioncombiningthreetypesofconditionalrelevance
AT huliang multilabelfeatureselectioncombiningthreetypesofconditionalrelevance