Cargando…

AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders

Motivation: The emergence of single-cell RNA sequencing (scRNA-seq) technology has paved the way for measuring RNA levels at single-cell resolution to study precise biological functions. However, the presence of a large number of missing values in its data will affect downstream analysis. This paper...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Li, Xu, Yin, Xue, Tong, Zhang, Xinyu, Li, Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8456123/
https://www.ncbi.nlm.nih.gov/pubmed/34567089
http://dx.doi.org/10.3389/fgene.2021.739677
_version_ 1784570814497030144
author Xu, Li
Xu, Yin
Xue, Tong
Zhang, Xinyu
Li, Jin
author_facet Xu, Li
Xu, Yin
Xue, Tong
Zhang, Xinyu
Li, Jin
author_sort Xu, Li
collection PubMed
description Motivation: The emergence of single-cell RNA sequencing (scRNA-seq) technology has paved the way for measuring RNA levels at single-cell resolution to study precise biological functions. However, the presence of a large number of missing values in its data will affect downstream analysis. This paper presents AdImpute: an imputation method based on semi-supervised autoencoders. The method uses another imputation method (DrImpute is used as an example) to fill the results as imputation weights of the autoencoder, and applies the cost function with imputation weights to learn the latent information in the data to achieve more accurate imputation. Results: As shown in clustering experiments with the simulated data sets and the real data sets, AdImpute is more accurate than other four publicly available scRNA-seq imputation methods, and minimally modifies the biologically silent genes. Overall, AdImpute is an accurate and robust imputation method.
format Online
Article
Text
id pubmed-8456123
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84561232021-09-23 AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders Xu, Li Xu, Yin Xue, Tong Zhang, Xinyu Li, Jin Front Genet Genetics Motivation: The emergence of single-cell RNA sequencing (scRNA-seq) technology has paved the way for measuring RNA levels at single-cell resolution to study precise biological functions. However, the presence of a large number of missing values in its data will affect downstream analysis. This paper presents AdImpute: an imputation method based on semi-supervised autoencoders. The method uses another imputation method (DrImpute is used as an example) to fill the results as imputation weights of the autoencoder, and applies the cost function with imputation weights to learn the latent information in the data to achieve more accurate imputation. Results: As shown in clustering experiments with the simulated data sets and the real data sets, AdImpute is more accurate than other four publicly available scRNA-seq imputation methods, and minimally modifies the biologically silent genes. Overall, AdImpute is an accurate and robust imputation method. Frontiers Media S.A. 2021-09-08 /pmc/articles/PMC8456123/ /pubmed/34567089 http://dx.doi.org/10.3389/fgene.2021.739677 Text en Copyright © 2021 Xu, Xu, Xue, Zhang and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xu, Li
Xu, Yin
Xue, Tong
Zhang, Xinyu
Li, Jin
AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders
title AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders
title_full AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders
title_fullStr AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders
title_full_unstemmed AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders
title_short AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders
title_sort adimpute: an imputation method for single-cell rna-seq data based on semi-supervised autoencoders
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8456123/
https://www.ncbi.nlm.nih.gov/pubmed/34567089
http://dx.doi.org/10.3389/fgene.2021.739677
work_keys_str_mv AT xuli adimputeanimputationmethodforsinglecellrnaseqdatabasedonsemisupervisedautoencoders
AT xuyin adimputeanimputationmethodforsinglecellrnaseqdatabasedonsemisupervisedautoencoders
AT xuetong adimputeanimputationmethodforsinglecellrnaseqdatabasedonsemisupervisedautoencoders
AT zhangxinyu adimputeanimputationmethodforsinglecellrnaseqdatabasedonsemisupervisedautoencoders
AT lijin adimputeanimputationmethodforsinglecellrnaseqdatabasedonsemisupervisedautoencoders