Cargando…

A Variable-Clustering-Based Feature Selection to Improve Positive and Negative Discrimination of P53 Protein in Colorectal Cancer Patients

P53 protein tumor suppressor gene plays a guiding role in the treatment and prognosis of colorectal cancer (CRC). This paper aimed at proposing a feature selection method based on variable clustering to improve positive and negative discrimination of P53 protein in CRC patients. In this approach, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Luqing, Feng, Li, Wang, Jiasi, Li, Jie, Li, Hongbin, Zeng, Fanxin, Sun, Liangli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9691320/
https://www.ncbi.nlm.nih.gov/pubmed/36439951
http://dx.doi.org/10.1155/2022/9261713
_version_ 1784837014707765248
author Wang, Luqing
Feng, Li
Wang, Jiasi
Li, Jie
Li, Hongbin
Zeng, Fanxin
Sun, Liangli
author_facet Wang, Luqing
Feng, Li
Wang, Jiasi
Li, Jie
Li, Hongbin
Zeng, Fanxin
Sun, Liangli
author_sort Wang, Luqing
collection PubMed
description P53 protein tumor suppressor gene plays a guiding role in the treatment and prognosis of colorectal cancer (CRC). This paper aimed at proposing a feature selection method based on variable clustering to improve positive and negative discrimination of P53 protein in CRC patients. In this approach, we cluster the preprocessed dataset with variables, and then find the features with the largest information value (IV) for each cluster to form a feature subset. We call this method as IV_Cluster. In the actual medical data test, compared with the information value feature selection method, the accuracy of the 10-fold cross-validation logistic regression model increased by 4.4%, 2.0%, and 5.8%, and Kappa values increased by 21.8%, 8.6%, and 22.4%, respectively, under 5, 10, and 15 feature sets.
format Online
Article
Text
id pubmed-9691320
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-96913202022-11-25 A Variable-Clustering-Based Feature Selection to Improve Positive and Negative Discrimination of P53 Protein in Colorectal Cancer Patients Wang, Luqing Feng, Li Wang, Jiasi Li, Jie Li, Hongbin Zeng, Fanxin Sun, Liangli Comput Math Methods Med Research Article P53 protein tumor suppressor gene plays a guiding role in the treatment and prognosis of colorectal cancer (CRC). This paper aimed at proposing a feature selection method based on variable clustering to improve positive and negative discrimination of P53 protein in CRC patients. In this approach, we cluster the preprocessed dataset with variables, and then find the features with the largest information value (IV) for each cluster to form a feature subset. We call this method as IV_Cluster. In the actual medical data test, compared with the information value feature selection method, the accuracy of the 10-fold cross-validation logistic regression model increased by 4.4%, 2.0%, and 5.8%, and Kappa values increased by 21.8%, 8.6%, and 22.4%, respectively, under 5, 10, and 15 feature sets. Hindawi 2022-11-17 /pmc/articles/PMC9691320/ /pubmed/36439951 http://dx.doi.org/10.1155/2022/9261713 Text en Copyright © 2022 Luqing Wang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wang, Luqing
Feng, Li
Wang, Jiasi
Li, Jie
Li, Hongbin
Zeng, Fanxin
Sun, Liangli
A Variable-Clustering-Based Feature Selection to Improve Positive and Negative Discrimination of P53 Protein in Colorectal Cancer Patients
title A Variable-Clustering-Based Feature Selection to Improve Positive and Negative Discrimination of P53 Protein in Colorectal Cancer Patients
title_full A Variable-Clustering-Based Feature Selection to Improve Positive and Negative Discrimination of P53 Protein in Colorectal Cancer Patients
title_fullStr A Variable-Clustering-Based Feature Selection to Improve Positive and Negative Discrimination of P53 Protein in Colorectal Cancer Patients
title_full_unstemmed A Variable-Clustering-Based Feature Selection to Improve Positive and Negative Discrimination of P53 Protein in Colorectal Cancer Patients
title_short A Variable-Clustering-Based Feature Selection to Improve Positive and Negative Discrimination of P53 Protein in Colorectal Cancer Patients
title_sort variable-clustering-based feature selection to improve positive and negative discrimination of p53 protein in colorectal cancer patients
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9691320/
https://www.ncbi.nlm.nih.gov/pubmed/36439951
http://dx.doi.org/10.1155/2022/9261713
work_keys_str_mv AT wangluqing avariableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT fengli avariableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT wangjiasi avariableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT lijie avariableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT lihongbin avariableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT zengfanxin avariableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT sunliangli avariableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT wangluqing variableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT fengli variableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT wangjiasi variableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT lijie variableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT lihongbin variableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT zengfanxin variableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients
AT sunliangli variableclusteringbasedfeatureselectiontoimprovepositiveandnegativediscriminationofp53proteinincolorectalcancerpatients