Cargando…

Imputation method for single-cell RNA-seq data using neural topic model

Single-cell RNA sequencing (scRNA-seq) technology studies transcriptome and cell-to-cell differences from higher single-cell resolution and different perspectives. Despite the advantage of high capture efficiency, downstream functional analysis of scRNA-seq data is made difficult by the excess of ze...

Descripción completa

Detalles Bibliográficos
Autores principales: Qi, Yueyang, Han, Shuangkai, Tang, Lin, Liu, Lin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673642/
https://www.ncbi.nlm.nih.gov/pubmed/38000911
http://dx.doi.org/10.1093/gigascience/giad098
_version_ 1785149626595147776
author Qi, Yueyang
Han, Shuangkai
Tang, Lin
Liu, Lin
author_facet Qi, Yueyang
Han, Shuangkai
Tang, Lin
Liu, Lin
author_sort Qi, Yueyang
collection PubMed
description Single-cell RNA sequencing (scRNA-seq) technology studies transcriptome and cell-to-cell differences from higher single-cell resolution and different perspectives. Despite the advantage of high capture efficiency, downstream functional analysis of scRNA-seq data is made difficult by the excess of zero values (i.e., the dropout phenomenon). To effectively address this problem, we introduced scNTImpute, an imputation framework based on a neural topic model. A neural network encoder is used to extract underlying topic features of single-cell transcriptome data to infer high-quality cell similarity. At the same time, we determine which transcriptome data are affected by the dropout phenomenon according to the learning of the mixture model by the neural network. On the basis of stable cell similarity, the same gene information in other similar cells is borrowed to impute only the missing expression values. By evaluating the performance of real data, scNTImpute can accurately and efficiently identify the dropout values and imputes them accurately. In the meantime, the clustering of cell subsets is improved and the original biological information in cell clustering is solved, which is covered by technical noise. The source code for the scNTImpute module is available as open source at https://github.com/qiyueyang-7/scNTImpute.git.
format Online
Article
Text
id pubmed-10673642
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106736422023-11-24 Imputation method for single-cell RNA-seq data using neural topic model Qi, Yueyang Han, Shuangkai Tang, Lin Liu, Lin Gigascience Technical Note Single-cell RNA sequencing (scRNA-seq) technology studies transcriptome and cell-to-cell differences from higher single-cell resolution and different perspectives. Despite the advantage of high capture efficiency, downstream functional analysis of scRNA-seq data is made difficult by the excess of zero values (i.e., the dropout phenomenon). To effectively address this problem, we introduced scNTImpute, an imputation framework based on a neural topic model. A neural network encoder is used to extract underlying topic features of single-cell transcriptome data to infer high-quality cell similarity. At the same time, we determine which transcriptome data are affected by the dropout phenomenon according to the learning of the mixture model by the neural network. On the basis of stable cell similarity, the same gene information in other similar cells is borrowed to impute only the missing expression values. By evaluating the performance of real data, scNTImpute can accurately and efficiently identify the dropout values and imputes them accurately. In the meantime, the clustering of cell subsets is improved and the original biological information in cell clustering is solved, which is covered by technical noise. The source code for the scNTImpute module is available as open source at https://github.com/qiyueyang-7/scNTImpute.git. Oxford University Press 2023-11-24 /pmc/articles/PMC10673642/ /pubmed/38000911 http://dx.doi.org/10.1093/gigascience/giad098 Text en © The Author(s) 2023. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Qi, Yueyang
Han, Shuangkai
Tang, Lin
Liu, Lin
Imputation method for single-cell RNA-seq data using neural topic model
title Imputation method for single-cell RNA-seq data using neural topic model
title_full Imputation method for single-cell RNA-seq data using neural topic model
title_fullStr Imputation method for single-cell RNA-seq data using neural topic model
title_full_unstemmed Imputation method for single-cell RNA-seq data using neural topic model
title_short Imputation method for single-cell RNA-seq data using neural topic model
title_sort imputation method for single-cell rna-seq data using neural topic model
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673642/
https://www.ncbi.nlm.nih.gov/pubmed/38000911
http://dx.doi.org/10.1093/gigascience/giad098
work_keys_str_mv AT qiyueyang imputationmethodforsinglecellrnaseqdatausingneuraltopicmodel
AT hanshuangkai imputationmethodforsinglecellrnaseqdatausingneuraltopicmodel
AT tanglin imputationmethodforsinglecellrnaseqdatausingneuraltopicmodel
AT liulin imputationmethodforsinglecellrnaseqdatausingneuraltopicmodel