Cargando…

Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis

BACKGROUND: Feature selection is an essential task in single-cell RNA-seq (scRNA-seq) data analysis and can be critical for gene dimension reduction and downstream analyses, such as gene marker identification and cell type classification. Most popular methods for feature selection from scRNA-seq dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Hao, Liu, Chunlei, Wagle, Manoj M., Yang, Pengyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10638755/
https://www.ncbi.nlm.nih.gov/pubmed/37950331
http://dx.doi.org/10.1186/s13059-023-03100-x
_version_ 1785133663957024768
author Huang, Hao
Liu, Chunlei
Wagle, Manoj M.
Yang, Pengyi
author_facet Huang, Hao
Liu, Chunlei
Wagle, Manoj M.
Yang, Pengyi
author_sort Huang, Hao
collection PubMed
description BACKGROUND: Feature selection is an essential task in single-cell RNA-seq (scRNA-seq) data analysis and can be critical for gene dimension reduction and downstream analyses, such as gene marker identification and cell type classification. Most popular methods for feature selection from scRNA-seq data are based on the concept of differential distribution wherein a statistical model is used to detect changes in gene expression among cell types. Recent development of deep learning-based feature selection methods provides an alternative approach compared to traditional differential distribution-based methods in that the importance of a gene is determined by neural networks. RESULTS: In this work, we explore the utility of various deep learning-based feature selection methods for scRNA-seq data analysis. We sample from Tabula Muris and Tabula Sapiens atlases to create scRNA-seq datasets with a range of data properties and evaluate the performance of traditional and deep learning-based feature selection methods for cell type classification, feature selection reproducibility and diversity, and computational time. CONCLUSIONS: Our study provides a reference for future development and application of deep learning-based feature selection methods for single-cell omics data analyses. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03100-x.
format Online
Article
Text
id pubmed-10638755
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-106387552023-11-11 Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis Huang, Hao Liu, Chunlei Wagle, Manoj M. Yang, Pengyi Genome Biol Research BACKGROUND: Feature selection is an essential task in single-cell RNA-seq (scRNA-seq) data analysis and can be critical for gene dimension reduction and downstream analyses, such as gene marker identification and cell type classification. Most popular methods for feature selection from scRNA-seq data are based on the concept of differential distribution wherein a statistical model is used to detect changes in gene expression among cell types. Recent development of deep learning-based feature selection methods provides an alternative approach compared to traditional differential distribution-based methods in that the importance of a gene is determined by neural networks. RESULTS: In this work, we explore the utility of various deep learning-based feature selection methods for scRNA-seq data analysis. We sample from Tabula Muris and Tabula Sapiens atlases to create scRNA-seq datasets with a range of data properties and evaluate the performance of traditional and deep learning-based feature selection methods for cell type classification, feature selection reproducibility and diversity, and computational time. CONCLUSIONS: Our study provides a reference for future development and application of deep learning-based feature selection methods for single-cell omics data analyses. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03100-x. BioMed Central 2023-11-10 /pmc/articles/PMC10638755/ /pubmed/37950331 http://dx.doi.org/10.1186/s13059-023-03100-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Huang, Hao
Liu, Chunlei
Wagle, Manoj M.
Yang, Pengyi
Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis
title Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis
title_full Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis
title_fullStr Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis
title_full_unstemmed Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis
title_short Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis
title_sort evaluation of deep learning-based feature selection for single-cell rna sequencing data analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10638755/
https://www.ncbi.nlm.nih.gov/pubmed/37950331
http://dx.doi.org/10.1186/s13059-023-03100-x
work_keys_str_mv AT huanghao evaluationofdeeplearningbasedfeatureselectionforsinglecellrnasequencingdataanalysis
AT liuchunlei evaluationofdeeplearningbasedfeatureselectionforsinglecellrnasequencingdataanalysis
AT waglemanojm evaluationofdeeplearningbasedfeatureselectionforsinglecellrnasequencingdataanalysis
AT yangpengyi evaluationofdeeplearningbasedfeatureselectionforsinglecellrnasequencingdataanalysis