Cargando…
An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification
Identifying the key genes related to tumors from gene expression data with a large number of features is important for the accurate classification of tumors and to make special treatment decisions. In recent years, unsupervised feature selection algorithms have attracted considerable attention in th...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8197094/ https://www.ncbi.nlm.nih.gov/pubmed/34071066 http://dx.doi.org/10.3390/s21113627 |
_version_ | 1783706840133009408 |
---|---|
author | Jin, Bo Fu, Chunling Jin, Yong Yang, Wei Li, Shengbin Zhang, Guangyao Wang, Zheng |
author_facet | Jin, Bo Fu, Chunling Jin, Yong Yang, Wei Li, Shengbin Zhang, Guangyao Wang, Zheng |
author_sort | Jin, Bo |
collection | PubMed |
description | Identifying the key genes related to tumors from gene expression data with a large number of features is important for the accurate classification of tumors and to make special treatment decisions. In recent years, unsupervised feature selection algorithms have attracted considerable attention in the field of gene selection as they can find the most discriminating subsets of genes, namely the potential information in biological data. Recent research also shows that maintaining the important structure of data is necessary for gene selection. However, most current feature selection methods merely capture the local structure of the original data while ignoring the importance of the global structure of the original data. We believe that the global structure and local structure of the original data are equally important, and so the selected genes should maintain the essential structure of the original data as far as possible. In this paper, we propose a new, adaptive, unsupervised feature selection scheme which not only reconstructs high-dimensional data into a low-dimensional space with the constraint of feature distance invariance but also employs [Formula: see text]-norm to enable a matrix with the ability to perform gene selection embedding into the local manifold structure-learning framework. Moreover, an effective algorithm is developed to solve the optimization problem based on the proposed scheme. Comparative experiments with some classical schemes on real tumor datasets demonstrate the effectiveness of the proposed method. |
format | Online Article Text |
id | pubmed-8197094 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-81970942021-06-13 An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification Jin, Bo Fu, Chunling Jin, Yong Yang, Wei Li, Shengbin Zhang, Guangyao Wang, Zheng Sensors (Basel) Article Identifying the key genes related to tumors from gene expression data with a large number of features is important for the accurate classification of tumors and to make special treatment decisions. In recent years, unsupervised feature selection algorithms have attracted considerable attention in the field of gene selection as they can find the most discriminating subsets of genes, namely the potential information in biological data. Recent research also shows that maintaining the important structure of data is necessary for gene selection. However, most current feature selection methods merely capture the local structure of the original data while ignoring the importance of the global structure of the original data. We believe that the global structure and local structure of the original data are equally important, and so the selected genes should maintain the essential structure of the original data as far as possible. In this paper, we propose a new, adaptive, unsupervised feature selection scheme which not only reconstructs high-dimensional data into a low-dimensional space with the constraint of feature distance invariance but also employs [Formula: see text]-norm to enable a matrix with the ability to perform gene selection embedding into the local manifold structure-learning framework. Moreover, an effective algorithm is developed to solve the optimization problem based on the proposed scheme. Comparative experiments with some classical schemes on real tumor datasets demonstrate the effectiveness of the proposed method. MDPI 2021-05-23 /pmc/articles/PMC8197094/ /pubmed/34071066 http://dx.doi.org/10.3390/s21113627 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Jin, Bo Fu, Chunling Jin, Yong Yang, Wei Li, Shengbin Zhang, Guangyao Wang, Zheng An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification |
title | An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification |
title_full | An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification |
title_fullStr | An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification |
title_full_unstemmed | An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification |
title_short | An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification |
title_sort | adaptive unsupervised feature selection algorithm based on mds for tumor gene data classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8197094/ https://www.ncbi.nlm.nih.gov/pubmed/34071066 http://dx.doi.org/10.3390/s21113627 |
work_keys_str_mv | AT jinbo anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT fuchunling anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT jinyong anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT yangwei anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT lishengbin anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT zhangguangyao anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT wangzheng anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT jinbo adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT fuchunling adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT jinyong adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT yangwei adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT lishengbin adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT zhangguangyao adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification AT wangzheng adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification |