Cargando…

scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data

Technical variation in feature measurements, such as gene expression and locus accessibility, is a key challenge of large-scale single-cell genomic datasets. We show that this technical variation in both scRNA-seq and scATAC-seq datasets can be mitigated by analyzing feature detection patterns alone...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Ruoxin, Quon, Gerald
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6734238/
https://www.ncbi.nlm.nih.gov/pubmed/31500668
http://dx.doi.org/10.1186/s13059-019-1806-0
_version_ 1783450110896635904
author Li, Ruoxin
Quon, Gerald
author_facet Li, Ruoxin
Quon, Gerald
author_sort Li, Ruoxin
collection PubMed
description Technical variation in feature measurements, such as gene expression and locus accessibility, is a key challenge of large-scale single-cell genomic datasets. We show that this technical variation in both scRNA-seq and scATAC-seq datasets can be mitigated by analyzing feature detection patterns alone and ignoring feature quantification measurements. This result holds when datasets have low detection noise relative to quantification noise. We demonstrate state-of-the-art performance of detection pattern models using our new framework, scBFA, for both cell type identification and trajectory inference. Performance gains can also be realized in one line of R code in existing pipelines. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-019-1806-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6734238
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-67342382019-09-12 scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data Li, Ruoxin Quon, Gerald Genome Biol Method Technical variation in feature measurements, such as gene expression and locus accessibility, is a key challenge of large-scale single-cell genomic datasets. We show that this technical variation in both scRNA-seq and scATAC-seq datasets can be mitigated by analyzing feature detection patterns alone and ignoring feature quantification measurements. This result holds when datasets have low detection noise relative to quantification noise. We demonstrate state-of-the-art performance of detection pattern models using our new framework, scBFA, for both cell type identification and trajectory inference. Performance gains can also be realized in one line of R code in existing pipelines. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-019-1806-0) contains supplementary material, which is available to authorized users. BioMed Central 2019-09-09 /pmc/articles/PMC6734238/ /pubmed/31500668 http://dx.doi.org/10.1186/s13059-019-1806-0 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Method
Li, Ruoxin
Quon, Gerald
scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data
title scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data
title_full scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data
title_fullStr scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data
title_full_unstemmed scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data
title_short scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data
title_sort scbfa: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6734238/
https://www.ncbi.nlm.nih.gov/pubmed/31500668
http://dx.doi.org/10.1186/s13059-019-1806-0
work_keys_str_mv AT liruoxin scbfamodelingdetectionpatternstomitigatetechnicalnoiseinlargescalesinglecellgenomicsdata
AT quongerald scbfamodelingdetectionpatternstomitigatetechnicalnoiseinlargescalesinglecellgenomicsdata