Cargando…
Graph convolutional network-based feature selection for high-dimensional and low-sample size data
MOTIVATION: Feature selection is a powerful dimension reduction technique which selects a subset of relevant features for model construction. Numerous feature selection methods have been proposed, but most of them fail under the high-dimensional and low-sample size (HDLSS) setting due to the challen...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126323/ https://www.ncbi.nlm.nih.gov/pubmed/37084264 http://dx.doi.org/10.1093/bioinformatics/btad135 |
_version_ | 1785030217186672640 |
---|---|
author | Chen, Can Weiss, Scott T Liu, Yang-Yu |
author_facet | Chen, Can Weiss, Scott T Liu, Yang-Yu |
author_sort | Chen, Can |
collection | PubMed |
description | MOTIVATION: Feature selection is a powerful dimension reduction technique which selects a subset of relevant features for model construction. Numerous feature selection methods have been proposed, but most of them fail under the high-dimensional and low-sample size (HDLSS) setting due to the challenge of overfitting. RESULTS: We present a deep learning-based method—GRAph Convolutional nEtwork feature Selector (GRACES)—to select important features for HDLSS data. GRACES exploits latent relations between samples with various overfitting-reducing techniques to iteratively find a set of optimal features which gives rise to the greatest decreases in the optimization loss. We demonstrate that GRACES significantly outperforms other feature selection methods on both synthetic and real-world datasets. AVAILABILITY AND IMPLEMENTATION: The source code is publicly available at https://github.com/canc1993/graces. |
format | Online Article Text |
id | pubmed-10126323 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101263232023-04-26 Graph convolutional network-based feature selection for high-dimensional and low-sample size data Chen, Can Weiss, Scott T Liu, Yang-Yu Bioinformatics Original Paper MOTIVATION: Feature selection is a powerful dimension reduction technique which selects a subset of relevant features for model construction. Numerous feature selection methods have been proposed, but most of them fail under the high-dimensional and low-sample size (HDLSS) setting due to the challenge of overfitting. RESULTS: We present a deep learning-based method—GRAph Convolutional nEtwork feature Selector (GRACES)—to select important features for HDLSS data. GRACES exploits latent relations between samples with various overfitting-reducing techniques to iteratively find a set of optimal features which gives rise to the greatest decreases in the optimization loss. We demonstrate that GRACES significantly outperforms other feature selection methods on both synthetic and real-world datasets. AVAILABILITY AND IMPLEMENTATION: The source code is publicly available at https://github.com/canc1993/graces. Oxford University Press 2023-04-21 /pmc/articles/PMC10126323/ /pubmed/37084264 http://dx.doi.org/10.1093/bioinformatics/btad135 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Chen, Can Weiss, Scott T Liu, Yang-Yu Graph convolutional network-based feature selection for high-dimensional and low-sample size data |
title | Graph convolutional network-based feature selection for high-dimensional and low-sample size data |
title_full | Graph convolutional network-based feature selection for high-dimensional and low-sample size data |
title_fullStr | Graph convolutional network-based feature selection for high-dimensional and low-sample size data |
title_full_unstemmed | Graph convolutional network-based feature selection for high-dimensional and low-sample size data |
title_short | Graph convolutional network-based feature selection for high-dimensional and low-sample size data |
title_sort | graph convolutional network-based feature selection for high-dimensional and low-sample size data |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126323/ https://www.ncbi.nlm.nih.gov/pubmed/37084264 http://dx.doi.org/10.1093/bioinformatics/btad135 |
work_keys_str_mv | AT chencan graphconvolutionalnetworkbasedfeatureselectionforhighdimensionalandlowsamplesizedata AT weissscottt graphconvolutionalnetworkbasedfeatureselectionforhighdimensionalandlowsamplesizedata AT liuyangyu graphconvolutionalnetworkbasedfeatureselectionforhighdimensionalandlowsamplesizedata |