Cargando…

Error curves for evaluating the quality of feature rankings

In this article, we propose a method for evaluating feature ranking algorithms. A feature ranking algorithm estimates the importance of descriptive features when predicting the target variable, and the proposed method evaluates the correctness of these importance values by computing the error measur...

Descripción completa

Detalles Bibliográficos
Autores principales: Slavkov, Ivica, Petković, Matej, Geurts, Pierre, Kocev, Dragi, Džeroski, Sašo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924685/
https://www.ncbi.nlm.nih.gov/pubmed/33816961
http://dx.doi.org/10.7717/peerj-cs.310
Descripción
Sumario:In this article, we propose a method for evaluating feature ranking algorithms. A feature ranking algorithm estimates the importance of descriptive features when predicting the target variable, and the proposed method evaluates the correctness of these importance values by computing the error measures of two chains of predictive models. The models in the first chain are built on nested sets of top-ranked features, while the models in the other chain are built on nested sets of bottom ranked features. We investigate which predictive models are appropriate for building these chains, showing empirically that the proposed method gives meaningful results and can detect differences in feature ranking quality. This is first demonstrated on synthetic data, and then on several real-world classification benchmark problems.