Cargando…

McImpute: Matrix Completion Based Imputation for Single Cell RNA-seq Data

Motivation: Single-cell RNA sequencing has been proved to be revolutionary for its potential of zooming into complex biological systems. Genome-wide expression analysis at single-cell resolution provides a window into dynamics of cellular phenotypes. This facilitates the characterization of transcri...

Descripción completa

Detalles Bibliográficos
Autores principales: Mongia, Aanchal, Sengupta, Debarka, Majumdar, Angshul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6361810/
https://www.ncbi.nlm.nih.gov/pubmed/30761179
http://dx.doi.org/10.3389/fgene.2019.00009
_version_ 1783392745342107648
author Mongia, Aanchal
Sengupta, Debarka
Majumdar, Angshul
author_facet Mongia, Aanchal
Sengupta, Debarka
Majumdar, Angshul
author_sort Mongia, Aanchal
collection PubMed
description Motivation: Single-cell RNA sequencing has been proved to be revolutionary for its potential of zooming into complex biological systems. Genome-wide expression analysis at single-cell resolution provides a window into dynamics of cellular phenotypes. This facilitates the characterization of transcriptional heterogeneity in normal and diseased tissues under various conditions. It also sheds light on the development or emergence of specific cell populations and phenotypes. However, owing to the paucity of input RNA, a typical single cell RNA sequencing data features a high number of dropout events where transcripts fail to get amplified. Results: We introduce mcImpute, a low-rank matrix completion based technique to impute dropouts in single cell expression data. On a number of real datasets, application of mcImpute yields significant improvements in the separation of true zeros from dropouts, cell-clustering, differential expression analysis, cell type separability, the performance of dimensionality reduction techniques for cell visualization, and gene distribution. Availability and Implementation: https://github.com/aanchalMongia/McImpute_scRNAseq
format Online
Article
Text
id pubmed-6361810
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-63618102019-02-13 McImpute: Matrix Completion Based Imputation for Single Cell RNA-seq Data Mongia, Aanchal Sengupta, Debarka Majumdar, Angshul Front Genet Genetics Motivation: Single-cell RNA sequencing has been proved to be revolutionary for its potential of zooming into complex biological systems. Genome-wide expression analysis at single-cell resolution provides a window into dynamics of cellular phenotypes. This facilitates the characterization of transcriptional heterogeneity in normal and diseased tissues under various conditions. It also sheds light on the development or emergence of specific cell populations and phenotypes. However, owing to the paucity of input RNA, a typical single cell RNA sequencing data features a high number of dropout events where transcripts fail to get amplified. Results: We introduce mcImpute, a low-rank matrix completion based technique to impute dropouts in single cell expression data. On a number of real datasets, application of mcImpute yields significant improvements in the separation of true zeros from dropouts, cell-clustering, differential expression analysis, cell type separability, the performance of dimensionality reduction techniques for cell visualization, and gene distribution. Availability and Implementation: https://github.com/aanchalMongia/McImpute_scRNAseq Frontiers Media S.A. 2019-01-29 /pmc/articles/PMC6361810/ /pubmed/30761179 http://dx.doi.org/10.3389/fgene.2019.00009 Text en Copyright © 2019 Mongia, Sengupta and Majumdar. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Mongia, Aanchal
Sengupta, Debarka
Majumdar, Angshul
McImpute: Matrix Completion Based Imputation for Single Cell RNA-seq Data
title McImpute: Matrix Completion Based Imputation for Single Cell RNA-seq Data
title_full McImpute: Matrix Completion Based Imputation for Single Cell RNA-seq Data
title_fullStr McImpute: Matrix Completion Based Imputation for Single Cell RNA-seq Data
title_full_unstemmed McImpute: Matrix Completion Based Imputation for Single Cell RNA-seq Data
title_short McImpute: Matrix Completion Based Imputation for Single Cell RNA-seq Data
title_sort mcimpute: matrix completion based imputation for single cell rna-seq data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6361810/
https://www.ncbi.nlm.nih.gov/pubmed/30761179
http://dx.doi.org/10.3389/fgene.2019.00009
work_keys_str_mv AT mongiaaanchal mcimputematrixcompletionbasedimputationforsinglecellrnaseqdata
AT senguptadebarka mcimputematrixcompletionbasedimputationforsinglecellrnaseqdata
AT majumdarangshul mcimputematrixcompletionbasedimputationforsinglecellrnaseqdata