Cargando…
The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data
Feature selection plays an important role in improving the performance of classification or reducing the dimensionality of high-dimensional datasets, such as high-throughput genomics/proteomics data in bioinformatics. As a popular approach with computational efficiency and scalability, information t...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10378569/ https://www.ncbi.nlm.nih.gov/pubmed/37509950 http://dx.doi.org/10.3390/e25071003 |
_version_ | 1785079799931207680 |
---|---|
author | Wang, Yangyang Gao, Xiaoguang Ru, Xinxin Sun, Pengzhan Wang, Jihan |
author_facet | Wang, Yangyang Gao, Xiaoguang Ru, Xinxin Sun, Pengzhan Wang, Jihan |
author_sort | Wang, Yangyang |
collection | PubMed |
description | Feature selection plays an important role in improving the performance of classification or reducing the dimensionality of high-dimensional datasets, such as high-throughput genomics/proteomics data in bioinformatics. As a popular approach with computational efficiency and scalability, information theory has been widely incorporated into feature selection. In this study, we propose a unique weight-based feature selection (WBFS) algorithm that assesses selected features and candidate features to identify the key protein biomarkers for classifying lung cancer subtypes from The Cancer Proteome Atlas (TCPA) database and we further explored the survival analysis between selected biomarkers and subtypes of lung cancer. Results show good performance of the combination of our WBFS method and Bayesian network for mining potential biomarkers. These candidate signatures have valuable biological significance in tumor classification and patient survival analysis. Taken together, this study proposes the WBFS method that helps to explore candidate biomarkers from biomedical datasets and provides useful information for tumor diagnosis or therapy strategies. |
format | Online Article Text |
id | pubmed-10378569 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-103785692023-07-29 The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data Wang, Yangyang Gao, Xiaoguang Ru, Xinxin Sun, Pengzhan Wang, Jihan Entropy (Basel) Article Feature selection plays an important role in improving the performance of classification or reducing the dimensionality of high-dimensional datasets, such as high-throughput genomics/proteomics data in bioinformatics. As a popular approach with computational efficiency and scalability, information theory has been widely incorporated into feature selection. In this study, we propose a unique weight-based feature selection (WBFS) algorithm that assesses selected features and candidate features to identify the key protein biomarkers for classifying lung cancer subtypes from The Cancer Proteome Atlas (TCPA) database and we further explored the survival analysis between selected biomarkers and subtypes of lung cancer. Results show good performance of the combination of our WBFS method and Bayesian network for mining potential biomarkers. These candidate signatures have valuable biological significance in tumor classification and patient survival analysis. Taken together, this study proposes the WBFS method that helps to explore candidate biomarkers from biomedical datasets and provides useful information for tumor diagnosis or therapy strategies. MDPI 2023-06-29 /pmc/articles/PMC10378569/ /pubmed/37509950 http://dx.doi.org/10.3390/e25071003 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Yangyang Gao, Xiaoguang Ru, Xinxin Sun, Pengzhan Wang, Jihan The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data |
title | The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data |
title_full | The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data |
title_fullStr | The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data |
title_full_unstemmed | The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data |
title_short | The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data |
title_sort | weight-based feature selection (wbfs) algorithm classifies lung cancer subtypes using proteomic data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10378569/ https://www.ncbi.nlm.nih.gov/pubmed/37509950 http://dx.doi.org/10.3390/e25071003 |
work_keys_str_mv | AT wangyangyang theweightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata AT gaoxiaoguang theweightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata AT ruxinxin theweightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata AT sunpengzhan theweightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata AT wangjihan theweightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata AT wangyangyang weightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata AT gaoxiaoguang weightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata AT ruxinxin weightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata AT sunpengzhan weightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata AT wangjihan weightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata |