Cargando…
Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis
BACKGROUND: Feature selection is an essential task in single-cell RNA-seq (scRNA-seq) data analysis and can be critical for gene dimension reduction and downstream analyses, such as gene marker identification and cell type classification. Most popular methods for feature selection from scRNA-seq dat...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10638755/ https://www.ncbi.nlm.nih.gov/pubmed/37950331 http://dx.doi.org/10.1186/s13059-023-03100-x |
_version_ | 1785133663957024768 |
---|---|
author | Huang, Hao Liu, Chunlei Wagle, Manoj M. Yang, Pengyi |
author_facet | Huang, Hao Liu, Chunlei Wagle, Manoj M. Yang, Pengyi |
author_sort | Huang, Hao |
collection | PubMed |
description | BACKGROUND: Feature selection is an essential task in single-cell RNA-seq (scRNA-seq) data analysis and can be critical for gene dimension reduction and downstream analyses, such as gene marker identification and cell type classification. Most popular methods for feature selection from scRNA-seq data are based on the concept of differential distribution wherein a statistical model is used to detect changes in gene expression among cell types. Recent development of deep learning-based feature selection methods provides an alternative approach compared to traditional differential distribution-based methods in that the importance of a gene is determined by neural networks. RESULTS: In this work, we explore the utility of various deep learning-based feature selection methods for scRNA-seq data analysis. We sample from Tabula Muris and Tabula Sapiens atlases to create scRNA-seq datasets with a range of data properties and evaluate the performance of traditional and deep learning-based feature selection methods for cell type classification, feature selection reproducibility and diversity, and computational time. CONCLUSIONS: Our study provides a reference for future development and application of deep learning-based feature selection methods for single-cell omics data analyses. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03100-x. |
format | Online Article Text |
id | pubmed-10638755 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-106387552023-11-11 Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis Huang, Hao Liu, Chunlei Wagle, Manoj M. Yang, Pengyi Genome Biol Research BACKGROUND: Feature selection is an essential task in single-cell RNA-seq (scRNA-seq) data analysis and can be critical for gene dimension reduction and downstream analyses, such as gene marker identification and cell type classification. Most popular methods for feature selection from scRNA-seq data are based on the concept of differential distribution wherein a statistical model is used to detect changes in gene expression among cell types. Recent development of deep learning-based feature selection methods provides an alternative approach compared to traditional differential distribution-based methods in that the importance of a gene is determined by neural networks. RESULTS: In this work, we explore the utility of various deep learning-based feature selection methods for scRNA-seq data analysis. We sample from Tabula Muris and Tabula Sapiens atlases to create scRNA-seq datasets with a range of data properties and evaluate the performance of traditional and deep learning-based feature selection methods for cell type classification, feature selection reproducibility and diversity, and computational time. CONCLUSIONS: Our study provides a reference for future development and application of deep learning-based feature selection methods for single-cell omics data analyses. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03100-x. BioMed Central 2023-11-10 /pmc/articles/PMC10638755/ /pubmed/37950331 http://dx.doi.org/10.1186/s13059-023-03100-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Huang, Hao Liu, Chunlei Wagle, Manoj M. Yang, Pengyi Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis |
title | Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis |
title_full | Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis |
title_fullStr | Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis |
title_full_unstemmed | Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis |
title_short | Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis |
title_sort | evaluation of deep learning-based feature selection for single-cell rna sequencing data analysis |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10638755/ https://www.ncbi.nlm.nih.gov/pubmed/37950331 http://dx.doi.org/10.1186/s13059-023-03100-x |
work_keys_str_mv | AT huanghao evaluationofdeeplearningbasedfeatureselectionforsinglecellrnasequencingdataanalysis AT liuchunlei evaluationofdeeplearningbasedfeatureselectionforsinglecellrnasequencingdataanalysis AT waglemanojm evaluationofdeeplearningbasedfeatureselectionforsinglecellrnasequencingdataanalysis AT yangpengyi evaluationofdeeplearningbasedfeatureselectionforsinglecellrnasequencingdataanalysis |