Cargando…
Imputation method for single-cell RNA-seq data using neural topic model
Single-cell RNA sequencing (scRNA-seq) technology studies transcriptome and cell-to-cell differences from higher single-cell resolution and different perspectives. Despite the advantage of high capture efficiency, downstream functional analysis of scRNA-seq data is made difficult by the excess of ze...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673642/ https://www.ncbi.nlm.nih.gov/pubmed/38000911 http://dx.doi.org/10.1093/gigascience/giad098 |
_version_ | 1785149626595147776 |
---|---|
author | Qi, Yueyang Han, Shuangkai Tang, Lin Liu, Lin |
author_facet | Qi, Yueyang Han, Shuangkai Tang, Lin Liu, Lin |
author_sort | Qi, Yueyang |
collection | PubMed |
description | Single-cell RNA sequencing (scRNA-seq) technology studies transcriptome and cell-to-cell differences from higher single-cell resolution and different perspectives. Despite the advantage of high capture efficiency, downstream functional analysis of scRNA-seq data is made difficult by the excess of zero values (i.e., the dropout phenomenon). To effectively address this problem, we introduced scNTImpute, an imputation framework based on a neural topic model. A neural network encoder is used to extract underlying topic features of single-cell transcriptome data to infer high-quality cell similarity. At the same time, we determine which transcriptome data are affected by the dropout phenomenon according to the learning of the mixture model by the neural network. On the basis of stable cell similarity, the same gene information in other similar cells is borrowed to impute only the missing expression values. By evaluating the performance of real data, scNTImpute can accurately and efficiently identify the dropout values and imputes them accurately. In the meantime, the clustering of cell subsets is improved and the original biological information in cell clustering is solved, which is covered by technical noise. The source code for the scNTImpute module is available as open source at https://github.com/qiyueyang-7/scNTImpute.git. |
format | Online Article Text |
id | pubmed-10673642 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106736422023-11-24 Imputation method for single-cell RNA-seq data using neural topic model Qi, Yueyang Han, Shuangkai Tang, Lin Liu, Lin Gigascience Technical Note Single-cell RNA sequencing (scRNA-seq) technology studies transcriptome and cell-to-cell differences from higher single-cell resolution and different perspectives. Despite the advantage of high capture efficiency, downstream functional analysis of scRNA-seq data is made difficult by the excess of zero values (i.e., the dropout phenomenon). To effectively address this problem, we introduced scNTImpute, an imputation framework based on a neural topic model. A neural network encoder is used to extract underlying topic features of single-cell transcriptome data to infer high-quality cell similarity. At the same time, we determine which transcriptome data are affected by the dropout phenomenon according to the learning of the mixture model by the neural network. On the basis of stable cell similarity, the same gene information in other similar cells is borrowed to impute only the missing expression values. By evaluating the performance of real data, scNTImpute can accurately and efficiently identify the dropout values and imputes them accurately. In the meantime, the clustering of cell subsets is improved and the original biological information in cell clustering is solved, which is covered by technical noise. The source code for the scNTImpute module is available as open source at https://github.com/qiyueyang-7/scNTImpute.git. Oxford University Press 2023-11-24 /pmc/articles/PMC10673642/ /pubmed/38000911 http://dx.doi.org/10.1093/gigascience/giad098 Text en © The Author(s) 2023. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note Qi, Yueyang Han, Shuangkai Tang, Lin Liu, Lin Imputation method for single-cell RNA-seq data using neural topic model |
title | Imputation method for single-cell RNA-seq data using neural topic model |
title_full | Imputation method for single-cell RNA-seq data using neural topic model |
title_fullStr | Imputation method for single-cell RNA-seq data using neural topic model |
title_full_unstemmed | Imputation method for single-cell RNA-seq data using neural topic model |
title_short | Imputation method for single-cell RNA-seq data using neural topic model |
title_sort | imputation method for single-cell rna-seq data using neural topic model |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673642/ https://www.ncbi.nlm.nih.gov/pubmed/38000911 http://dx.doi.org/10.1093/gigascience/giad098 |
work_keys_str_mv | AT qiyueyang imputationmethodforsinglecellrnaseqdatausingneuraltopicmodel AT hanshuangkai imputationmethodforsinglecellrnaseqdatausingneuraltopicmodel AT tanglin imputationmethodforsinglecellrnaseqdatausingneuraltopicmodel AT liulin imputationmethodforsinglecellrnaseqdatausingneuraltopicmodel |