Cargando…

Knowledge database assisted gene marker selection for chronic lymphocytic leukemia

OBJECTIVE: To investigate whether previously curated chronic lymphocytic leukemia (CLL) risk genes could be leveraged in gene marker selection for the diagnosis and prediction of CLL. METHODS: A CLL genetic database (CLL_042017) was developed through a comprehensive CLL-gene relation data analysis,...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiang, Xixi, Wang, Yu-Ping, Cao, Hongbao, Zhang, Xi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6134680/
https://www.ncbi.nlm.nih.gov/pubmed/29996709
http://dx.doi.org/10.1177/0300060518783072
_version_ 1783354703660187648
author Xiang, Xixi
Wang, Yu-Ping
Cao, Hongbao
Zhang, Xi
author_facet Xiang, Xixi
Wang, Yu-Ping
Cao, Hongbao
Zhang, Xi
author_sort Xiang, Xixi
collection PubMed
description OBJECTIVE: To investigate whether previously curated chronic lymphocytic leukemia (CLL) risk genes could be leveraged in gene marker selection for the diagnosis and prediction of CLL. METHODS: A CLL genetic database (CLL_042017) was developed through a comprehensive CLL-gene relation data analysis, in which 753 CLL target genes were curated. Expression values for these genes were used for case-control classification of four CLL datasets, with a sparse representation-based variable selection (SRVS) approach employed for feature (gene) selection. Results were compared with outcomes obtained by using analysis of variance (ANOVA)-based gene selection approaches. RESULTS: For each of the four datasets, SRVS selected a subset of genes from the 753 CLL target genes, resulting in significantly higher classification accuracy, compared with randomly selected genes (100%, 100%, 93.94%, 89.39%). The SRVS method outperformed ANOVA in terms of classification accuracy. CONCLUSION: Gene markers selected from the 753 CLL genes could enable significantly greater accuracy in the prediction of CLL. SRVS provides an effective method for gene marker selection.
format Online
Article
Text
id pubmed-6134680
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-61346802018-09-13 Knowledge database assisted gene marker selection for chronic lymphocytic leukemia Xiang, Xixi Wang, Yu-Ping Cao, Hongbao Zhang, Xi J Int Med Res Pre-Clinical Research Reports OBJECTIVE: To investigate whether previously curated chronic lymphocytic leukemia (CLL) risk genes could be leveraged in gene marker selection for the diagnosis and prediction of CLL. METHODS: A CLL genetic database (CLL_042017) was developed through a comprehensive CLL-gene relation data analysis, in which 753 CLL target genes were curated. Expression values for these genes were used for case-control classification of four CLL datasets, with a sparse representation-based variable selection (SRVS) approach employed for feature (gene) selection. Results were compared with outcomes obtained by using analysis of variance (ANOVA)-based gene selection approaches. RESULTS: For each of the four datasets, SRVS selected a subset of genes from the 753 CLL target genes, resulting in significantly higher classification accuracy, compared with randomly selected genes (100%, 100%, 93.94%, 89.39%). The SRVS method outperformed ANOVA in terms of classification accuracy. CONCLUSION: Gene markers selected from the 753 CLL genes could enable significantly greater accuracy in the prediction of CLL. SRVS provides an effective method for gene marker selection. SAGE Publications 2018-07-12 2018-08 /pmc/articles/PMC6134680/ /pubmed/29996709 http://dx.doi.org/10.1177/0300060518783072 Text en © The Author(s) 2018 http://creativecommons.org/licenses/by-nc/4.0/ Creative Commons Non Commercial CC BY-NC: This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Pre-Clinical Research Reports
Xiang, Xixi
Wang, Yu-Ping
Cao, Hongbao
Zhang, Xi
Knowledge database assisted gene marker selection for chronic lymphocytic leukemia
title Knowledge database assisted gene marker selection for chronic lymphocytic leukemia
title_full Knowledge database assisted gene marker selection for chronic lymphocytic leukemia
title_fullStr Knowledge database assisted gene marker selection for chronic lymphocytic leukemia
title_full_unstemmed Knowledge database assisted gene marker selection for chronic lymphocytic leukemia
title_short Knowledge database assisted gene marker selection for chronic lymphocytic leukemia
title_sort knowledge database assisted gene marker selection for chronic lymphocytic leukemia
topic Pre-Clinical Research Reports
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6134680/
https://www.ncbi.nlm.nih.gov/pubmed/29996709
http://dx.doi.org/10.1177/0300060518783072
work_keys_str_mv AT xiangxixi knowledgedatabaseassistedgenemarkerselectionforchroniclymphocyticleukemia
AT wangyuping knowledgedatabaseassistedgenemarkerselectionforchroniclymphocyticleukemia
AT caohongbao knowledgedatabaseassistedgenemarkerselectionforchroniclymphocyticleukemia
AT zhangxi knowledgedatabaseassistedgenemarkerselectionforchroniclymphocyticleukemia