Cargando…

Graph convolutional network-based feature selection for high-dimensional and low-sample size data

MOTIVATION: Feature selection is a powerful dimension reduction technique which selects a subset of relevant features for model construction. Numerous feature selection methods have been proposed, but most of them fail under the high-dimensional and low-sample size (HDLSS) setting due to the challen...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Can, Weiss, Scott T, Liu, Yang-Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126323/
https://www.ncbi.nlm.nih.gov/pubmed/37084264
http://dx.doi.org/10.1093/bioinformatics/btad135
_version_ 1785030217186672640
author Chen, Can
Weiss, Scott T
Liu, Yang-Yu
author_facet Chen, Can
Weiss, Scott T
Liu, Yang-Yu
author_sort Chen, Can
collection PubMed
description MOTIVATION: Feature selection is a powerful dimension reduction technique which selects a subset of relevant features for model construction. Numerous feature selection methods have been proposed, but most of them fail under the high-dimensional and low-sample size (HDLSS) setting due to the challenge of overfitting. RESULTS: We present a deep learning-based method—GRAph Convolutional nEtwork feature Selector (GRACES)—to select important features for HDLSS data. GRACES exploits latent relations between samples with various overfitting-reducing techniques to iteratively find a set of optimal features which gives rise to the greatest decreases in the optimization loss. We demonstrate that GRACES significantly outperforms other feature selection methods on both synthetic and real-world datasets. AVAILABILITY AND IMPLEMENTATION: The source code is publicly available at https://github.com/canc1993/graces.
format Online
Article
Text
id pubmed-10126323
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101263232023-04-26 Graph convolutional network-based feature selection for high-dimensional and low-sample size data Chen, Can Weiss, Scott T Liu, Yang-Yu Bioinformatics Original Paper MOTIVATION: Feature selection is a powerful dimension reduction technique which selects a subset of relevant features for model construction. Numerous feature selection methods have been proposed, but most of them fail under the high-dimensional and low-sample size (HDLSS) setting due to the challenge of overfitting. RESULTS: We present a deep learning-based method—GRAph Convolutional nEtwork feature Selector (GRACES)—to select important features for HDLSS data. GRACES exploits latent relations between samples with various overfitting-reducing techniques to iteratively find a set of optimal features which gives rise to the greatest decreases in the optimization loss. We demonstrate that GRACES significantly outperforms other feature selection methods on both synthetic and real-world datasets. AVAILABILITY AND IMPLEMENTATION: The source code is publicly available at https://github.com/canc1993/graces. Oxford University Press 2023-04-21 /pmc/articles/PMC10126323/ /pubmed/37084264 http://dx.doi.org/10.1093/bioinformatics/btad135 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Chen, Can
Weiss, Scott T
Liu, Yang-Yu
Graph convolutional network-based feature selection for high-dimensional and low-sample size data
title Graph convolutional network-based feature selection for high-dimensional and low-sample size data
title_full Graph convolutional network-based feature selection for high-dimensional and low-sample size data
title_fullStr Graph convolutional network-based feature selection for high-dimensional and low-sample size data
title_full_unstemmed Graph convolutional network-based feature selection for high-dimensional and low-sample size data
title_short Graph convolutional network-based feature selection for high-dimensional and low-sample size data
title_sort graph convolutional network-based feature selection for high-dimensional and low-sample size data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126323/
https://www.ncbi.nlm.nih.gov/pubmed/37084264
http://dx.doi.org/10.1093/bioinformatics/btad135
work_keys_str_mv AT chencan graphconvolutionalnetworkbasedfeatureselectionforhighdimensionalandlowsamplesizedata
AT weissscottt graphconvolutionalnetworkbasedfeatureselectionforhighdimensionalandlowsamplesizedata
AT liuyangyu graphconvolutionalnetworkbasedfeatureselectionforhighdimensionalandlowsamplesizedata