Cargando…

An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification

Identifying the key genes related to tumors from gene expression data with a large number of features is important for the accurate classification of tumors and to make special treatment decisions. In recent years, unsupervised feature selection algorithms have attracted considerable attention in th...

Descripción completa

Detalles Bibliográficos
Autores principales: Jin, Bo, Fu, Chunling, Jin, Yong, Yang, Wei, Li, Shengbin, Zhang, Guangyao, Wang, Zheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8197094/
https://www.ncbi.nlm.nih.gov/pubmed/34071066
http://dx.doi.org/10.3390/s21113627
_version_ 1783706840133009408
author Jin, Bo
Fu, Chunling
Jin, Yong
Yang, Wei
Li, Shengbin
Zhang, Guangyao
Wang, Zheng
author_facet Jin, Bo
Fu, Chunling
Jin, Yong
Yang, Wei
Li, Shengbin
Zhang, Guangyao
Wang, Zheng
author_sort Jin, Bo
collection PubMed
description Identifying the key genes related to tumors from gene expression data with a large number of features is important for the accurate classification of tumors and to make special treatment decisions. In recent years, unsupervised feature selection algorithms have attracted considerable attention in the field of gene selection as they can find the most discriminating subsets of genes, namely the potential information in biological data. Recent research also shows that maintaining the important structure of data is necessary for gene selection. However, most current feature selection methods merely capture the local structure of the original data while ignoring the importance of the global structure of the original data. We believe that the global structure and local structure of the original data are equally important, and so the selected genes should maintain the essential structure of the original data as far as possible. In this paper, we propose a new, adaptive, unsupervised feature selection scheme which not only reconstructs high-dimensional data into a low-dimensional space with the constraint of feature distance invariance but also employs [Formula: see text]-norm to enable a matrix with the ability to perform gene selection embedding into the local manifold structure-learning framework. Moreover, an effective algorithm is developed to solve the optimization problem based on the proposed scheme. Comparative experiments with some classical schemes on real tumor datasets demonstrate the effectiveness of the proposed method.
format Online
Article
Text
id pubmed-8197094
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81970942021-06-13 An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification Jin, Bo Fu, Chunling Jin, Yong Yang, Wei Li, Shengbin Zhang, Guangyao Wang, Zheng Sensors (Basel) Article Identifying the key genes related to tumors from gene expression data with a large number of features is important for the accurate classification of tumors and to make special treatment decisions. In recent years, unsupervised feature selection algorithms have attracted considerable attention in the field of gene selection as they can find the most discriminating subsets of genes, namely the potential information in biological data. Recent research also shows that maintaining the important structure of data is necessary for gene selection. However, most current feature selection methods merely capture the local structure of the original data while ignoring the importance of the global structure of the original data. We believe that the global structure and local structure of the original data are equally important, and so the selected genes should maintain the essential structure of the original data as far as possible. In this paper, we propose a new, adaptive, unsupervised feature selection scheme which not only reconstructs high-dimensional data into a low-dimensional space with the constraint of feature distance invariance but also employs [Formula: see text]-norm to enable a matrix with the ability to perform gene selection embedding into the local manifold structure-learning framework. Moreover, an effective algorithm is developed to solve the optimization problem based on the proposed scheme. Comparative experiments with some classical schemes on real tumor datasets demonstrate the effectiveness of the proposed method. MDPI 2021-05-23 /pmc/articles/PMC8197094/ /pubmed/34071066 http://dx.doi.org/10.3390/s21113627 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jin, Bo
Fu, Chunling
Jin, Yong
Yang, Wei
Li, Shengbin
Zhang, Guangyao
Wang, Zheng
An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification
title An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification
title_full An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification
title_fullStr An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification
title_full_unstemmed An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification
title_short An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification
title_sort adaptive unsupervised feature selection algorithm based on mds for tumor gene data classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8197094/
https://www.ncbi.nlm.nih.gov/pubmed/34071066
http://dx.doi.org/10.3390/s21113627
work_keys_str_mv AT jinbo anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT fuchunling anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT jinyong anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT yangwei anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT lishengbin anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT zhangguangyao anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT wangzheng anadaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT jinbo adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT fuchunling adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT jinyong adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT yangwei adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT lishengbin adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT zhangguangyao adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification
AT wangzheng adaptiveunsupervisedfeatureselectionalgorithmbasedonmdsfortumorgenedataclassification