Cargando…

Flexible experimental designs for valid single-cell RNA-sequencing experiments allowing batch effects correction

Despite their widespread applications, single-cell RNA-sequencing (scRNA-seq) experiments are still plagued by batch effects and dropout events. Although the completely randomized experimental design has frequently been advocated to control for batch effects, it is rarely implemented in real applica...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Fangda, Chan, Ga Ming Angus, Wei, Yingying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7330047/
https://www.ncbi.nlm.nih.gov/pubmed/32612268
http://dx.doi.org/10.1038/s41467-020-16905-2
_version_ 1783553027379036160
author Song, Fangda
Chan, Ga Ming Angus
Wei, Yingying
author_facet Song, Fangda
Chan, Ga Ming Angus
Wei, Yingying
author_sort Song, Fangda
collection PubMed
description Despite their widespread applications, single-cell RNA-sequencing (scRNA-seq) experiments are still plagued by batch effects and dropout events. Although the completely randomized experimental design has frequently been advocated to control for batch effects, it is rarely implemented in real applications due to time and budget constraints. Here, we mathematically prove that under two more flexible and realistic experimental designs—the reference panel and the chain-type designs—true biological variability can also be separated from batch effects. We develop Batch effects correction with Unknown Subtypes for scRNA-seq data (BUSseq), which is an interpretable Bayesian hierarchical model that closely follows the data-generating mechanism of scRNA-seq experiments. BUSseq can simultaneously correct batch effects, cluster cell types, impute missing data caused by dropout events, and detect differentially expressed genes without requiring a preliminary normalization step. We demonstrate that BUSseq outperforms existing methods with simulated and real data.
format Online
Article
Text
id pubmed-7330047
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73300472020-07-06 Flexible experimental designs for valid single-cell RNA-sequencing experiments allowing batch effects correction Song, Fangda Chan, Ga Ming Angus Wei, Yingying Nat Commun Article Despite their widespread applications, single-cell RNA-sequencing (scRNA-seq) experiments are still plagued by batch effects and dropout events. Although the completely randomized experimental design has frequently been advocated to control for batch effects, it is rarely implemented in real applications due to time and budget constraints. Here, we mathematically prove that under two more flexible and realistic experimental designs—the reference panel and the chain-type designs—true biological variability can also be separated from batch effects. We develop Batch effects correction with Unknown Subtypes for scRNA-seq data (BUSseq), which is an interpretable Bayesian hierarchical model that closely follows the data-generating mechanism of scRNA-seq experiments. BUSseq can simultaneously correct batch effects, cluster cell types, impute missing data caused by dropout events, and detect differentially expressed genes without requiring a preliminary normalization step. We demonstrate that BUSseq outperforms existing methods with simulated and real data. Nature Publishing Group UK 2020-07-01 /pmc/articles/PMC7330047/ /pubmed/32612268 http://dx.doi.org/10.1038/s41467-020-16905-2 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Song, Fangda
Chan, Ga Ming Angus
Wei, Yingying
Flexible experimental designs for valid single-cell RNA-sequencing experiments allowing batch effects correction
title Flexible experimental designs for valid single-cell RNA-sequencing experiments allowing batch effects correction
title_full Flexible experimental designs for valid single-cell RNA-sequencing experiments allowing batch effects correction
title_fullStr Flexible experimental designs for valid single-cell RNA-sequencing experiments allowing batch effects correction
title_full_unstemmed Flexible experimental designs for valid single-cell RNA-sequencing experiments allowing batch effects correction
title_short Flexible experimental designs for valid single-cell RNA-sequencing experiments allowing batch effects correction
title_sort flexible experimental designs for valid single-cell rna-sequencing experiments allowing batch effects correction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7330047/
https://www.ncbi.nlm.nih.gov/pubmed/32612268
http://dx.doi.org/10.1038/s41467-020-16905-2
work_keys_str_mv AT songfangda flexibleexperimentaldesignsforvalidsinglecellrnasequencingexperimentsallowingbatcheffectscorrection
AT changamingangus flexibleexperimentaldesignsforvalidsinglecellrnasequencingexperimentsallowingbatcheffectscorrection
AT weiyingying flexibleexperimentaldesignsforvalidsinglecellrnasequencingexperimentsallowingbatcheffectscorrection