Cargando…

Feature Selection and Cancer Classification via Sparse Logistic Regression with the Hybrid L(1/2 +2) Regularization

Cancer classification and feature (gene) selection plays an important role in knowledge discovery in genomic data. Although logistic regression is one of the most popular classification methods, it does not induce feature selection. In this paper, we presented a new hybrid L(1/2 +2) regularization (...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Hai-Hui, Liu, Xiao-Ying, Liang, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4852916/
https://www.ncbi.nlm.nih.gov/pubmed/27136190
http://dx.doi.org/10.1371/journal.pone.0149675
Descripción
Sumario:Cancer classification and feature (gene) selection plays an important role in knowledge discovery in genomic data. Although logistic regression is one of the most popular classification methods, it does not induce feature selection. In this paper, we presented a new hybrid L(1/2 +2) regularization (HLR) function, a linear combination of L(1/2) and L(2) penalties, to select the relevant gene in the logistic regression. The HLR approach inherits some fascinating characteristics from L(1/2) (sparsity) and L(2) (grouping effect where highly correlated variables are in or out a model together) penalties. We also proposed a novel univariate HLR thresholding approach to update the estimated coefficients and developed the coordinate descent algorithm for the HLR penalized logistic regression model. The empirical results and simulations indicate that the proposed method is highly competitive amongst several state-of-the-art methods.