Cargando…

Recursive Random Lasso (RRLasso) for Identifying Anti-Cancer Drug Targets

Uncovering driver genes is crucial for understanding heterogeneity in cancer. L (1)-type regularization approaches have been widely used for uncovering cancer driver genes based on genome-scale data. Although the existing methods have been widely applied in the field of bioinformatics, they possess...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Heewon, Imoto, Seiya, Miyano, Satoru
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4636151/
https://www.ncbi.nlm.nih.gov/pubmed/26544691
http://dx.doi.org/10.1371/journal.pone.0141869
Descripción
Sumario:Uncovering driver genes is crucial for understanding heterogeneity in cancer. L (1)-type regularization approaches have been widely used for uncovering cancer driver genes based on genome-scale data. Although the existing methods have been widely applied in the field of bioinformatics, they possess several drawbacks: subset size limitations, erroneous estimation results, multicollinearity, and heavy time consumption. We introduce a novel statistical strategy, called a Recursive Random Lasso (RRLasso), for high dimensional genomic data analysis and investigation of driver genes. For time-effective analysis, we consider a recursive bootstrap procedure in line with the random lasso. Furthermore, we introduce a parametric statistical test for driver gene selection based on bootstrap regression modeling results. The proposed RRLasso is not only rapid but performs well for high dimensional genomic data analysis. Monte Carlo simulations and analysis of the “Sanger Genomics of Drug Sensitivity in Cancer dataset from the Cancer Genome Project” show that the proposed RRLasso is an effective tool for high dimensional genomic data analysis. The proposed methods provide reliable and biologically relevant results for cancer driver gene selection.