Cargando…
Regularized F-Measure Maximization for Feature Selection and Classification
Receiver Operating Characteristic (ROC) analysis is a common tool for assessing the performance of various classifications. It gained much popularity in medical and other fields including biological markers and, diagnostic test. This is particularly due to the fact that in real-world problems miscla...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674633/ https://www.ncbi.nlm.nih.gov/pubmed/19421401 http://dx.doi.org/10.1155/2009/617946 |
_version_ | 1782166656620953600 |
---|---|
author | Liu, Zhenqiu Tan, Ming Jiang, Feng |
author_facet | Liu, Zhenqiu Tan, Ming Jiang, Feng |
author_sort | Liu, Zhenqiu |
collection | PubMed |
description | Receiver Operating Characteristic (ROC) analysis is a common tool for assessing the performance of various classifications. It gained much popularity in medical and other fields including biological markers and, diagnostic test. This is particularly due to the fact that in real-world problems misclassification costs are not known, and thus, ROC curve and related utility functions such as F-measure can be more meaningful performance measures. F-measure combines recall and precision into a global measure. In this paper, we propose a novel method through regularized F-measure maximization. The proposed method assigns different costs to positive and negative samples and does simultaneous feature selection and prediction with L(1) penalty. This method is useful especially when data set is highly unbalanced, or the labels for negative (positive) samples are missing. Our experiments with the benchmark, methylation, and high dimensional microarray data show that the performance of proposed algorithm is better or equivalent compared with the other popular classifiers in limited experiments. |
format | Text |
id | pubmed-2674633 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-26746332009-05-06 Regularized F-Measure Maximization for Feature Selection and Classification Liu, Zhenqiu Tan, Ming Jiang, Feng J Biomed Biotechnol Methodology Report Receiver Operating Characteristic (ROC) analysis is a common tool for assessing the performance of various classifications. It gained much popularity in medical and other fields including biological markers and, diagnostic test. This is particularly due to the fact that in real-world problems misclassification costs are not known, and thus, ROC curve and related utility functions such as F-measure can be more meaningful performance measures. F-measure combines recall and precision into a global measure. In this paper, we propose a novel method through regularized F-measure maximization. The proposed method assigns different costs to positive and negative samples and does simultaneous feature selection and prediction with L(1) penalty. This method is useful especially when data set is highly unbalanced, or the labels for negative (positive) samples are missing. Our experiments with the benchmark, methylation, and high dimensional microarray data show that the performance of proposed algorithm is better or equivalent compared with the other popular classifiers in limited experiments. Hindawi Publishing Corporation 2009 2009-04-27 /pmc/articles/PMC2674633/ /pubmed/19421401 http://dx.doi.org/10.1155/2009/617946 Text en Copyright © 2009 Zhenqiu Liu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Report Liu, Zhenqiu Tan, Ming Jiang, Feng Regularized F-Measure Maximization for Feature Selection and Classification |
title | Regularized F-Measure Maximization for Feature Selection and Classification |
title_full | Regularized F-Measure Maximization for Feature Selection and Classification |
title_fullStr | Regularized F-Measure Maximization for Feature Selection and Classification |
title_full_unstemmed | Regularized F-Measure Maximization for Feature Selection and Classification |
title_short | Regularized F-Measure Maximization for Feature Selection and Classification |
title_sort | regularized f-measure maximization for feature selection and classification |
topic | Methodology Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674633/ https://www.ncbi.nlm.nih.gov/pubmed/19421401 http://dx.doi.org/10.1155/2009/617946 |
work_keys_str_mv | AT liuzhenqiu regularizedfmeasuremaximizationforfeatureselectionandclassification AT tanming regularizedfmeasuremaximizationforfeatureselectionandclassification AT jiangfeng regularizedfmeasuremaximizationforfeatureselectionandclassification |