Cargando…

Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data

Single-cell RNA sequencing (scRNA-seq) technologies allow researchers to uncover the biological states of a single cell at high resolution. For computational efficiency and easy visualization, dimensionality reduction is necessary to capture gene expression patterns in low-dimensional space. Here we...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Xiaoxiao, Liu, Yiwen, An, Lingling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7673125/
https://www.ncbi.nlm.nih.gov/pubmed/33203837
http://dx.doi.org/10.1038/s41467-020-19465-7
_version_ 1783611271663321088
author Sun, Xiaoxiao
Liu, Yiwen
An, Lingling
author_facet Sun, Xiaoxiao
Liu, Yiwen
An, Lingling
author_sort Sun, Xiaoxiao
collection PubMed
description Single-cell RNA sequencing (scRNA-seq) technologies allow researchers to uncover the biological states of a single cell at high resolution. For computational efficiency and easy visualization, dimensionality reduction is necessary to capture gene expression patterns in low-dimensional space. Here we propose an ensemble method for simultaneous dimensionality reduction and feature gene extraction (EDGE) of scRNA-seq data. Different from existing dimensionality reduction techniques, the proposed method implements an ensemble learning scheme that utilizes massive weak learners for an accurate similarity search. Based on the similarity matrix constructed by those weak learners, the low-dimensional embedding of the data is estimated and optimized through spectral embedding and stochastic gradient descent. Comprehensive simulation and empirical studies show that EDGE is well suited for searching for meaningful organization of cells, detecting rare cell types, and identifying essential feature genes associated with certain cell types.
format Online
Article
Text
id pubmed-7673125
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-76731252020-11-24 Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data Sun, Xiaoxiao Liu, Yiwen An, Lingling Nat Commun Article Single-cell RNA sequencing (scRNA-seq) technologies allow researchers to uncover the biological states of a single cell at high resolution. For computational efficiency and easy visualization, dimensionality reduction is necessary to capture gene expression patterns in low-dimensional space. Here we propose an ensemble method for simultaneous dimensionality reduction and feature gene extraction (EDGE) of scRNA-seq data. Different from existing dimensionality reduction techniques, the proposed method implements an ensemble learning scheme that utilizes massive weak learners for an accurate similarity search. Based on the similarity matrix constructed by those weak learners, the low-dimensional embedding of the data is estimated and optimized through spectral embedding and stochastic gradient descent. Comprehensive simulation and empirical studies show that EDGE is well suited for searching for meaningful organization of cells, detecting rare cell types, and identifying essential feature genes associated with certain cell types. Nature Publishing Group UK 2020-11-17 /pmc/articles/PMC7673125/ /pubmed/33203837 http://dx.doi.org/10.1038/s41467-020-19465-7 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Sun, Xiaoxiao
Liu, Yiwen
An, Lingling
Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
title Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
title_full Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
title_fullStr Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
title_full_unstemmed Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
title_short Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
title_sort ensemble dimensionality reduction and feature gene extraction for single-cell rna-seq data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7673125/
https://www.ncbi.nlm.nih.gov/pubmed/33203837
http://dx.doi.org/10.1038/s41467-020-19465-7
work_keys_str_mv AT sunxiaoxiao ensembledimensionalityreductionandfeaturegeneextractionforsinglecellrnaseqdata
AT liuyiwen ensembledimensionalityreductionandfeaturegeneextractionforsinglecellrnaseqdata
AT anlingling ensembledimensionalityreductionandfeaturegeneextractionforsinglecellrnaseqdata