Cargando…

Efficient Regularized Regression with L (0) Penalty for Variable Selection and Network Construction

Variable selections for regression with high-dimensional big data have found many applications in bioinformatics and computational biology. One appealing approach is the L (0) regularized regression which penalizes the number of nonzero features in the model directly. However, it is well known that...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Zhenqiu, Li, Gang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi Publishing Corporation 2016
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5098106/ https://www.ncbi.nlm.nih.gov/pubmed/27843486 http://dx.doi.org/10.1155/2016/3456153

_version_	1782465718121398272
author	Liu, Zhenqiu Li, Gang
author_facet	Liu, Zhenqiu Li, Gang
author_sort	Liu, Zhenqiu
collection	PubMed
description	Variable selections for regression with high-dimensional big data have found many applications in bioinformatics and computational biology. One appealing approach is the L (0) regularized regression which penalizes the number of nonzero features in the model directly. However, it is well known that L (0) optimization is NP-hard and computationally challenging. In this paper, we propose efficient EM (L (0)EM) and dual L (0)EM (DL (0)EM) algorithms that directly approximate the L (0) optimization problem. While L (0)EM is efficient with large sample size, DL (0)EM is efficient with high-dimensional (n ≪ m) data. They also provide a natural solution to all L (p) p ∈ [0,2] problems, including lasso with p = 1 and elastic net with p ∈ [1,2]. The regularized parameter λ can be determined through cross validation or AIC and BIC. We demonstrate our methods through simulation and high-dimensional genomic data. The results indicate that L (0) has better performance than lasso, SCAD, and MC+, and L (0) with AIC or BIC has similar performance as computationally intensive cross validation. The proposed algorithms are efficient in identifying the nonzero variables with less bias and constructing biologically important networks with high-dimensional big data.
format	Online Article Text
id	pubmed-5098106
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Hindawi Publishing Corporation
record_format	MEDLINE/PubMed
spelling	pubmed-50981062016-11-14 Efficient Regularized Regression with L (0) Penalty for Variable Selection and Network Construction Liu, Zhenqiu Li, Gang Comput Math Methods Med Research Article Variable selections for regression with high-dimensional big data have found many applications in bioinformatics and computational biology. One appealing approach is the L (0) regularized regression which penalizes the number of nonzero features in the model directly. However, it is well known that L (0) optimization is NP-hard and computationally challenging. In this paper, we propose efficient EM (L (0)EM) and dual L (0)EM (DL (0)EM) algorithms that directly approximate the L (0) optimization problem. While L (0)EM is efficient with large sample size, DL (0)EM is efficient with high-dimensional (n ≪ m) data. They also provide a natural solution to all L (p) p ∈ [0,2] problems, including lasso with p = 1 and elastic net with p ∈ [1,2]. The regularized parameter λ can be determined through cross validation or AIC and BIC. We demonstrate our methods through simulation and high-dimensional genomic data. The results indicate that L (0) has better performance than lasso, SCAD, and MC+, and L (0) with AIC or BIC has similar performance as computationally intensive cross validation. The proposed algorithms are efficient in identifying the nonzero variables with less bias and constructing biologically important networks with high-dimensional big data. Hindawi Publishing Corporation 2016 2016-10-24 /pmc/articles/PMC5098106/ /pubmed/27843486 http://dx.doi.org/10.1155/2016/3456153 Text en Copyright © 2016 Z. Liu and G. Li. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Liu, Zhenqiu Li, Gang Efficient Regularized Regression with L (0) Penalty for Variable Selection and Network Construction
title	Efficient Regularized Regression with L (0) Penalty for Variable Selection and Network Construction
title_full	Efficient Regularized Regression with L (0) Penalty for Variable Selection and Network Construction
title_fullStr	Efficient Regularized Regression with L (0) Penalty for Variable Selection and Network Construction
title_full_unstemmed	Efficient Regularized Regression with L (0) Penalty for Variable Selection and Network Construction
title_short	Efficient Regularized Regression with L (0) Penalty for Variable Selection and Network Construction
title_sort	efficient regularized regression with l (0) penalty for variable selection and network construction
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5098106/ https://www.ncbi.nlm.nih.gov/pubmed/27843486 http://dx.doi.org/10.1155/2016/3456153
work_keys_str_mv	AT liuzhenqiu efficientregularizedregressionwithl0penaltyforvariableselectionandnetworkconstruction AT ligang efficientregularizedregressionwithl0penaltyforvariableselectionandnetworkconstruction

Efficient Regularized Regression with L (0) Penalty for Variable Selection and Network Construction

Ejemplares similares