Cargando…

DrImpute: imputing dropout events in single cell RNA sequencing data

BACKGROUND: The single cell RNA sequencing (scRNA-seq) technique begin a new era by allowing the observation of gene expression at the single cell level. However, there is also a large amount of technical and biological noise. Because of the low number of RNA transcriptomes and the stochastic nature...

Descripción completa

Detalles Bibliográficos
Autores principales: Gong, Wuming, Kwak, Il-Youp, Pota, Pruthvi, Koyano-Nakagawa, Naoko, Garry, Daniel J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5994079/
https://www.ncbi.nlm.nih.gov/pubmed/29884114
http://dx.doi.org/10.1186/s12859-018-2226-y
_version_ 1783330352519970816
author Gong, Wuming
Kwak, Il-Youp
Pota, Pruthvi
Koyano-Nakagawa, Naoko
Garry, Daniel J.
author_facet Gong, Wuming
Kwak, Il-Youp
Pota, Pruthvi
Koyano-Nakagawa, Naoko
Garry, Daniel J.
author_sort Gong, Wuming
collection PubMed
description BACKGROUND: The single cell RNA sequencing (scRNA-seq) technique begin a new era by allowing the observation of gene expression at the single cell level. However, there is also a large amount of technical and biological noise. Because of the low number of RNA transcriptomes and the stochastic nature of the gene expression pattern, there is a high chance of missing nonzero entries as zero, which are called dropout events. RESULTS: We develop DrImpute to impute dropout events in scRNA-seq data. We show that DrImpute has significantly better performance on the separation of the dropout zeros from true zeros than existing imputation algorithms. We also demonstrate that DrImpute can significantly improve the performance of existing tools for clustering, visualization and lineage reconstruction of nine published scRNA-seq datasets. CONCLUSIONS: DrImpute can serve as a very useful addition to the currently existing statistical tools for single cell RNA-seq analysis. DrImpute is implemented in R and is available at https://github.com/gongx030/DrImpute. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2226-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5994079
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59940792018-06-21 DrImpute: imputing dropout events in single cell RNA sequencing data Gong, Wuming Kwak, Il-Youp Pota, Pruthvi Koyano-Nakagawa, Naoko Garry, Daniel J. BMC Bioinformatics Methodology Article BACKGROUND: The single cell RNA sequencing (scRNA-seq) technique begin a new era by allowing the observation of gene expression at the single cell level. However, there is also a large amount of technical and biological noise. Because of the low number of RNA transcriptomes and the stochastic nature of the gene expression pattern, there is a high chance of missing nonzero entries as zero, which are called dropout events. RESULTS: We develop DrImpute to impute dropout events in scRNA-seq data. We show that DrImpute has significantly better performance on the separation of the dropout zeros from true zeros than existing imputation algorithms. We also demonstrate that DrImpute can significantly improve the performance of existing tools for clustering, visualization and lineage reconstruction of nine published scRNA-seq datasets. CONCLUSIONS: DrImpute can serve as a very useful addition to the currently existing statistical tools for single cell RNA-seq analysis. DrImpute is implemented in R and is available at https://github.com/gongx030/DrImpute. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2226-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-08 /pmc/articles/PMC5994079/ /pubmed/29884114 http://dx.doi.org/10.1186/s12859-018-2226-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Gong, Wuming
Kwak, Il-Youp
Pota, Pruthvi
Koyano-Nakagawa, Naoko
Garry, Daniel J.
DrImpute: imputing dropout events in single cell RNA sequencing data
title DrImpute: imputing dropout events in single cell RNA sequencing data
title_full DrImpute: imputing dropout events in single cell RNA sequencing data
title_fullStr DrImpute: imputing dropout events in single cell RNA sequencing data
title_full_unstemmed DrImpute: imputing dropout events in single cell RNA sequencing data
title_short DrImpute: imputing dropout events in single cell RNA sequencing data
title_sort drimpute: imputing dropout events in single cell rna sequencing data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5994079/
https://www.ncbi.nlm.nih.gov/pubmed/29884114
http://dx.doi.org/10.1186/s12859-018-2226-y
work_keys_str_mv AT gongwuming drimputeimputingdropouteventsinsinglecellrnasequencingdata
AT kwakilyoup drimputeimputingdropouteventsinsinglecellrnasequencingdata
AT potapruthvi drimputeimputingdropouteventsinsinglecellrnasequencingdata
AT koyanonakagawanaoko drimputeimputingdropouteventsinsinglecellrnasequencingdata
AT garrydanielj drimputeimputingdropouteventsinsinglecellrnasequencingdata