Cargando…

Knowledge database assisted gene marker selection for chronic lymphocytic leukemia

OBJECTIVE: To investigate whether previously curated chronic lymphocytic leukemia (CLL) risk genes could be leveraged in gene marker selection for the diagnosis and prediction of CLL. METHODS: A CLL genetic database (CLL_042017) was developed through a comprehensive CLL-gene relation data analysis,...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiang, Xixi, Wang, Yu-Ping, Cao, Hongbao, Zhang, Xi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6134680/
https://www.ncbi.nlm.nih.gov/pubmed/29996709
http://dx.doi.org/10.1177/0300060518783072
Descripción
Sumario:OBJECTIVE: To investigate whether previously curated chronic lymphocytic leukemia (CLL) risk genes could be leveraged in gene marker selection for the diagnosis and prediction of CLL. METHODS: A CLL genetic database (CLL_042017) was developed through a comprehensive CLL-gene relation data analysis, in which 753 CLL target genes were curated. Expression values for these genes were used for case-control classification of four CLL datasets, with a sparse representation-based variable selection (SRVS) approach employed for feature (gene) selection. Results were compared with outcomes obtained by using analysis of variance (ANOVA)-based gene selection approaches. RESULTS: For each of the four datasets, SRVS selected a subset of genes from the 753 CLL target genes, resulting in significantly higher classification accuracy, compared with randomly selected genes (100%, 100%, 93.94%, 89.39%). The SRVS method outperformed ANOVA in terms of classification accuracy. CONCLUSION: Gene markers selected from the 753 CLL genes could enable significantly greater accuracy in the prediction of CLL. SRVS provides an effective method for gene marker selection.