Cargando…

BASiCS: Bayesian Analysis of Single-Cell Sequencing Data

Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine hetero...

Descripción completa

Detalles Bibliográficos
Autores principales: Vallejos, Catalina A., Marioni, John C., Richardson, Sylvia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4480965/
https://www.ncbi.nlm.nih.gov/pubmed/26107944
http://dx.doi.org/10.1371/journal.pcbi.1004333
_version_ 1782378220567396352
author Vallejos, Catalina A.
Marioni, John C.
Richardson, Sylvia
author_facet Vallejos, Catalina A.
Marioni, John C.
Richardson, Sylvia
author_sort Vallejos, Catalina A.
collection PubMed
description Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell’s lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach.
format Online
Article
Text
id pubmed-4480965
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44809652015-06-29 BASiCS: Bayesian Analysis of Single-Cell Sequencing Data Vallejos, Catalina A. Marioni, John C. Richardson, Sylvia PLoS Comput Biol Research Article Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of unexplained technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model where: (i) cell-specific normalisation constants are estimated as part of the model parameters, (ii) technical variability is quantified based on spike-in genes that are artificially introduced to each analysed cell’s lysate and (iii) the total variability of the expression counts is decomposed into technical and biological components. BASiCS also provides an intuitive detection criterion for highly (or lowly) variable genes within the population of cells under study. This is formalised by means of tail posterior probabilities associated to high (or low) biological cell-to-cell variance contributions, quantities that can be easily interpreted by users. We demonstrate our method using gene expression measurements from mouse Embryonic Stem Cells. Cross-validation and meaningful enrichment of gene ontology categories within genes classified as highly (or lowly) variable supports the efficacy of our approach. Public Library of Science 2015-06-24 /pmc/articles/PMC4480965/ /pubmed/26107944 http://dx.doi.org/10.1371/journal.pcbi.1004333 Text en © 2015 Vallejos et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Vallejos, Catalina A.
Marioni, John C.
Richardson, Sylvia
BASiCS: Bayesian Analysis of Single-Cell Sequencing Data
title BASiCS: Bayesian Analysis of Single-Cell Sequencing Data
title_full BASiCS: Bayesian Analysis of Single-Cell Sequencing Data
title_fullStr BASiCS: Bayesian Analysis of Single-Cell Sequencing Data
title_full_unstemmed BASiCS: Bayesian Analysis of Single-Cell Sequencing Data
title_short BASiCS: Bayesian Analysis of Single-Cell Sequencing Data
title_sort basics: bayesian analysis of single-cell sequencing data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4480965/
https://www.ncbi.nlm.nih.gov/pubmed/26107944
http://dx.doi.org/10.1371/journal.pcbi.1004333
work_keys_str_mv AT vallejoscatalinaa basicsbayesiananalysisofsinglecellsequencingdata
AT marionijohnc basicsbayesiananalysisofsinglecellsequencingdata
AT richardsonsylvia basicsbayesiananalysisofsinglecellsequencingdata