Cargando…

Feature Selection and Cancer Classification via Sparse Logistic Regression with the Hybrid L(1/2 +2) Regularization

Cancer classification and feature (gene) selection plays an important role in knowledge discovery in genomic data. Although logistic regression is one of the most popular classification methods, it does not induce feature selection. In this paper, we presented a new hybrid L(1/2 +2) regularization (...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Hai-Hui, Liu, Xiao-Ying, Liang, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4852916/
https://www.ncbi.nlm.nih.gov/pubmed/27136190
http://dx.doi.org/10.1371/journal.pone.0149675
_version_ 1782430007844405248
author Huang, Hai-Hui
Liu, Xiao-Ying
Liang, Yong
author_facet Huang, Hai-Hui
Liu, Xiao-Ying
Liang, Yong
author_sort Huang, Hai-Hui
collection PubMed
description Cancer classification and feature (gene) selection plays an important role in knowledge discovery in genomic data. Although logistic regression is one of the most popular classification methods, it does not induce feature selection. In this paper, we presented a new hybrid L(1/2 +2) regularization (HLR) function, a linear combination of L(1/2) and L(2) penalties, to select the relevant gene in the logistic regression. The HLR approach inherits some fascinating characteristics from L(1/2) (sparsity) and L(2) (grouping effect where highly correlated variables are in or out a model together) penalties. We also proposed a novel univariate HLR thresholding approach to update the estimated coefficients and developed the coordinate descent algorithm for the HLR penalized logistic regression model. The empirical results and simulations indicate that the proposed method is highly competitive amongst several state-of-the-art methods.
format Online
Article
Text
id pubmed-4852916
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-48529162016-05-13 Feature Selection and Cancer Classification via Sparse Logistic Regression with the Hybrid L(1/2 +2) Regularization Huang, Hai-Hui Liu, Xiao-Ying Liang, Yong PLoS One Research Article Cancer classification and feature (gene) selection plays an important role in knowledge discovery in genomic data. Although logistic regression is one of the most popular classification methods, it does not induce feature selection. In this paper, we presented a new hybrid L(1/2 +2) regularization (HLR) function, a linear combination of L(1/2) and L(2) penalties, to select the relevant gene in the logistic regression. The HLR approach inherits some fascinating characteristics from L(1/2) (sparsity) and L(2) (grouping effect where highly correlated variables are in or out a model together) penalties. We also proposed a novel univariate HLR thresholding approach to update the estimated coefficients and developed the coordinate descent algorithm for the HLR penalized logistic regression model. The empirical results and simulations indicate that the proposed method is highly competitive amongst several state-of-the-art methods. Public Library of Science 2016-05-02 /pmc/articles/PMC4852916/ /pubmed/27136190 http://dx.doi.org/10.1371/journal.pone.0149675 Text en © 2016 Huang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Huang, Hai-Hui
Liu, Xiao-Ying
Liang, Yong
Feature Selection and Cancer Classification via Sparse Logistic Regression with the Hybrid L(1/2 +2) Regularization
title Feature Selection and Cancer Classification via Sparse Logistic Regression with the Hybrid L(1/2 +2) Regularization
title_full Feature Selection and Cancer Classification via Sparse Logistic Regression with the Hybrid L(1/2 +2) Regularization
title_fullStr Feature Selection and Cancer Classification via Sparse Logistic Regression with the Hybrid L(1/2 +2) Regularization
title_full_unstemmed Feature Selection and Cancer Classification via Sparse Logistic Regression with the Hybrid L(1/2 +2) Regularization
title_short Feature Selection and Cancer Classification via Sparse Logistic Regression with the Hybrid L(1/2 +2) Regularization
title_sort feature selection and cancer classification via sparse logistic regression with the hybrid l(1/2 +2) regularization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4852916/
https://www.ncbi.nlm.nih.gov/pubmed/27136190
http://dx.doi.org/10.1371/journal.pone.0149675
work_keys_str_mv AT huanghaihui featureselectionandcancerclassificationviasparselogisticregressionwiththehybridl122regularization
AT liuxiaoying featureselectionandcancerclassificationviasparselogisticregressionwiththehybridl122regularization
AT liangyong featureselectionandcancerclassificationviasparselogisticregressionwiththehybridl122regularization