Cargando…

Relevance, redundancy, and complementarity trade-off (RRCT): A principled, generic, robust feature-selection tool

We present a new heuristic feature-selection (FS) algorithm that integrates in a principled algorithmic framework the three key FS components: relevance, redundancy, and complementarity. Thus, we call it relevance, redundancy, and complementarity trade-off (RRCT). The association strength between ea...

Descripción completa

Detalles Bibliográficos
Autor principal: Tsanas, Athanasios
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9122960/
https://www.ncbi.nlm.nih.gov/pubmed/35607618
http://dx.doi.org/10.1016/j.patter.2022.100471
_version_ 1784711458951528448
author Tsanas, Athanasios
author_facet Tsanas, Athanasios
author_sort Tsanas, Athanasios
collection PubMed
description We present a new heuristic feature-selection (FS) algorithm that integrates in a principled algorithmic framework the three key FS components: relevance, redundancy, and complementarity. Thus, we call it relevance, redundancy, and complementarity trade-off (RRCT). The association strength between each feature and the response and between feature pairs is quantified via an information theoretic transformation of rank correlation coefficients, and the feature complementarity is quantified using partial correlation coefficients. We empirically benchmark the performance of RRCT against 19 FS algorithms across four synthetic and eight real-world datasets in indicative challenging settings evaluating the following: (1) matching the true feature set and (2) out-of-sample performance in binary and multi-class classification problems when presenting selected features into a random forest. RRCT is very competitive in both tasks, and we tentatively make suggestions on the generalizability and application of the best-performing FS algorithms across settings where they may operate effectively.
format Online
Article
Text
id pubmed-9122960
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-91229602022-05-22 Relevance, redundancy, and complementarity trade-off (RRCT): A principled, generic, robust feature-selection tool Tsanas, Athanasios Patterns (N Y) Article We present a new heuristic feature-selection (FS) algorithm that integrates in a principled algorithmic framework the three key FS components: relevance, redundancy, and complementarity. Thus, we call it relevance, redundancy, and complementarity trade-off (RRCT). The association strength between each feature and the response and between feature pairs is quantified via an information theoretic transformation of rank correlation coefficients, and the feature complementarity is quantified using partial correlation coefficients. We empirically benchmark the performance of RRCT against 19 FS algorithms across four synthetic and eight real-world datasets in indicative challenging settings evaluating the following: (1) matching the true feature set and (2) out-of-sample performance in binary and multi-class classification problems when presenting selected features into a random forest. RRCT is very competitive in both tasks, and we tentatively make suggestions on the generalizability and application of the best-performing FS algorithms across settings where they may operate effectively. Elsevier 2022-03-31 /pmc/articles/PMC9122960/ /pubmed/35607618 http://dx.doi.org/10.1016/j.patter.2022.100471 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tsanas, Athanasios
Relevance, redundancy, and complementarity trade-off (RRCT): A principled, generic, robust feature-selection tool
title Relevance, redundancy, and complementarity trade-off (RRCT): A principled, generic, robust feature-selection tool
title_full Relevance, redundancy, and complementarity trade-off (RRCT): A principled, generic, robust feature-selection tool
title_fullStr Relevance, redundancy, and complementarity trade-off (RRCT): A principled, generic, robust feature-selection tool
title_full_unstemmed Relevance, redundancy, and complementarity trade-off (RRCT): A principled, generic, robust feature-selection tool
title_short Relevance, redundancy, and complementarity trade-off (RRCT): A principled, generic, robust feature-selection tool
title_sort relevance, redundancy, and complementarity trade-off (rrct): a principled, generic, robust feature-selection tool
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9122960/
https://www.ncbi.nlm.nih.gov/pubmed/35607618
http://dx.doi.org/10.1016/j.patter.2022.100471
work_keys_str_mv AT tsanasathanasios relevanceredundancyandcomplementaritytradeoffrrctaprincipledgenericrobustfeatureselectiontool