Cargando…

A Variable-Clustering-Based Feature Selection to Improve Positive and Negative Discrimination of P53 Protein in Colorectal Cancer Patients

P53 protein tumor suppressor gene plays a guiding role in the treatment and prognosis of colorectal cancer (CRC). This paper aimed at proposing a feature selection method based on variable clustering to improve positive and negative discrimination of P53 protein in CRC patients. In this approach, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Luqing, Feng, Li, Wang, Jiasi, Li, Jie, Li, Hongbin, Zeng, Fanxin, Sun, Liangli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9691320/
https://www.ncbi.nlm.nih.gov/pubmed/36439951
http://dx.doi.org/10.1155/2022/9261713
Descripción
Sumario:P53 protein tumor suppressor gene plays a guiding role in the treatment and prognosis of colorectal cancer (CRC). This paper aimed at proposing a feature selection method based on variable clustering to improve positive and negative discrimination of P53 protein in CRC patients. In this approach, we cluster the preprocessed dataset with variables, and then find the features with the largest information value (IV) for each cluster to form a feature subset. We call this method as IV_Cluster. In the actual medical data test, compared with the information value feature selection method, the accuracy of the 10-fold cross-validation logistic regression model increased by 4.4%, 2.0%, and 5.8%, and Kappa values increased by 21.8%, 8.6%, and 22.4%, respectively, under 5, 10, and 15 feature sets.