Cargando…

Regularized F-Measure Maximization for Feature Selection and Classification

Receiver Operating Characteristic (ROC) analysis is a common tool for assessing the performance of various classifications. It gained much popularity in medical and other fields including biological markers and, diagnostic test. This is particularly due to the fact that in real-world problems miscla...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zhenqiu, Tan, Ming, Jiang, Feng
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674633/
https://www.ncbi.nlm.nih.gov/pubmed/19421401
http://dx.doi.org/10.1155/2009/617946
_version_ 1782166656620953600
author Liu, Zhenqiu
Tan, Ming
Jiang, Feng
author_facet Liu, Zhenqiu
Tan, Ming
Jiang, Feng
author_sort Liu, Zhenqiu
collection PubMed
description Receiver Operating Characteristic (ROC) analysis is a common tool for assessing the performance of various classifications. It gained much popularity in medical and other fields including biological markers and, diagnostic test. This is particularly due to the fact that in real-world problems misclassification costs are not known, and thus, ROC curve and related utility functions such as F-measure can be more meaningful performance measures. F-measure combines recall and precision into a global measure. In this paper, we propose a novel method through regularized F-measure maximization. The proposed method assigns different costs to positive and negative samples and does simultaneous feature selection and prediction with L(1) penalty. This method is useful especially when data set is highly unbalanced, or the labels for negative (positive) samples are missing. Our experiments with the benchmark, methylation, and high dimensional microarray data show that the performance of proposed algorithm is better or equivalent compared with the other popular classifiers in limited experiments.
format Text
id pubmed-2674633
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-26746332009-05-06 Regularized F-Measure Maximization for Feature Selection and Classification Liu, Zhenqiu Tan, Ming Jiang, Feng J Biomed Biotechnol Methodology Report Receiver Operating Characteristic (ROC) analysis is a common tool for assessing the performance of various classifications. It gained much popularity in medical and other fields including biological markers and, diagnostic test. This is particularly due to the fact that in real-world problems misclassification costs are not known, and thus, ROC curve and related utility functions such as F-measure can be more meaningful performance measures. F-measure combines recall and precision into a global measure. In this paper, we propose a novel method through regularized F-measure maximization. The proposed method assigns different costs to positive and negative samples and does simultaneous feature selection and prediction with L(1) penalty. This method is useful especially when data set is highly unbalanced, or the labels for negative (positive) samples are missing. Our experiments with the benchmark, methylation, and high dimensional microarray data show that the performance of proposed algorithm is better or equivalent compared with the other popular classifiers in limited experiments. Hindawi Publishing Corporation 2009 2009-04-27 /pmc/articles/PMC2674633/ /pubmed/19421401 http://dx.doi.org/10.1155/2009/617946 Text en Copyright © 2009 Zhenqiu Liu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Report
Liu, Zhenqiu
Tan, Ming
Jiang, Feng
Regularized F-Measure Maximization for Feature Selection and Classification
title Regularized F-Measure Maximization for Feature Selection and Classification
title_full Regularized F-Measure Maximization for Feature Selection and Classification
title_fullStr Regularized F-Measure Maximization for Feature Selection and Classification
title_full_unstemmed Regularized F-Measure Maximization for Feature Selection and Classification
title_short Regularized F-Measure Maximization for Feature Selection and Classification
title_sort regularized f-measure maximization for feature selection and classification
topic Methodology Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674633/
https://www.ncbi.nlm.nih.gov/pubmed/19421401
http://dx.doi.org/10.1155/2009/617946
work_keys_str_mv AT liuzhenqiu regularizedfmeasuremaximizationforfeatureselectionandclassification
AT tanming regularizedfmeasuremaximizationforfeatureselectionandclassification
AT jiangfeng regularizedfmeasuremaximizationforfeatureselectionandclassification