Cargando…

The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data

Feature selection plays an important role in improving the performance of classification or reducing the dimensionality of high-dimensional datasets, such as high-throughput genomics/proteomics data in bioinformatics. As a popular approach with computational efficiency and scalability, information t...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yangyang, Gao, Xiaoguang, Ru, Xinxin, Sun, Pengzhan, Wang, Jihan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10378569/
https://www.ncbi.nlm.nih.gov/pubmed/37509950
http://dx.doi.org/10.3390/e25071003
_version_ 1785079799931207680
author Wang, Yangyang
Gao, Xiaoguang
Ru, Xinxin
Sun, Pengzhan
Wang, Jihan
author_facet Wang, Yangyang
Gao, Xiaoguang
Ru, Xinxin
Sun, Pengzhan
Wang, Jihan
author_sort Wang, Yangyang
collection PubMed
description Feature selection plays an important role in improving the performance of classification or reducing the dimensionality of high-dimensional datasets, such as high-throughput genomics/proteomics data in bioinformatics. As a popular approach with computational efficiency and scalability, information theory has been widely incorporated into feature selection. In this study, we propose a unique weight-based feature selection (WBFS) algorithm that assesses selected features and candidate features to identify the key protein biomarkers for classifying lung cancer subtypes from The Cancer Proteome Atlas (TCPA) database and we further explored the survival analysis between selected biomarkers and subtypes of lung cancer. Results show good performance of the combination of our WBFS method and Bayesian network for mining potential biomarkers. These candidate signatures have valuable biological significance in tumor classification and patient survival analysis. Taken together, this study proposes the WBFS method that helps to explore candidate biomarkers from biomedical datasets and provides useful information for tumor diagnosis or therapy strategies.
format Online
Article
Text
id pubmed-10378569
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103785692023-07-29 The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data Wang, Yangyang Gao, Xiaoguang Ru, Xinxin Sun, Pengzhan Wang, Jihan Entropy (Basel) Article Feature selection plays an important role in improving the performance of classification or reducing the dimensionality of high-dimensional datasets, such as high-throughput genomics/proteomics data in bioinformatics. As a popular approach with computational efficiency and scalability, information theory has been widely incorporated into feature selection. In this study, we propose a unique weight-based feature selection (WBFS) algorithm that assesses selected features and candidate features to identify the key protein biomarkers for classifying lung cancer subtypes from The Cancer Proteome Atlas (TCPA) database and we further explored the survival analysis between selected biomarkers and subtypes of lung cancer. Results show good performance of the combination of our WBFS method and Bayesian network for mining potential biomarkers. These candidate signatures have valuable biological significance in tumor classification and patient survival analysis. Taken together, this study proposes the WBFS method that helps to explore candidate biomarkers from biomedical datasets and provides useful information for tumor diagnosis or therapy strategies. MDPI 2023-06-29 /pmc/articles/PMC10378569/ /pubmed/37509950 http://dx.doi.org/10.3390/e25071003 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Yangyang
Gao, Xiaoguang
Ru, Xinxin
Sun, Pengzhan
Wang, Jihan
The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data
title The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data
title_full The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data
title_fullStr The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data
title_full_unstemmed The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data
title_short The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data
title_sort weight-based feature selection (wbfs) algorithm classifies lung cancer subtypes using proteomic data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10378569/
https://www.ncbi.nlm.nih.gov/pubmed/37509950
http://dx.doi.org/10.3390/e25071003
work_keys_str_mv AT wangyangyang theweightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata
AT gaoxiaoguang theweightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata
AT ruxinxin theweightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata
AT sunpengzhan theweightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata
AT wangjihan theweightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata
AT wangyangyang weightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata
AT gaoxiaoguang weightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata
AT ruxinxin weightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata
AT sunpengzhan weightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata
AT wangjihan weightbasedfeatureselectionwbfsalgorithmclassifieslungcancersubtypesusingproteomicdata