Cargando…

Batch effects and the effective design of single-cell gene expression studies

Single-cell RNA sequencing (scRNA-seq) can be used to characterize variation in gene expression levels at high resolution. However, the sources of experimental noise in scRNA-seq are not yet well understood. We investigated the technical variation associated with sample processing using the single-c...

Descripción completa

Detalles Bibliográficos
Autores principales: Tung, Po-Yuan, Blischak, John D., Hsiao, Chiaowen Joyce, Knowles, David A., Burnett, Jonathan E., Pritchard, Jonathan K., Gilad, Yoav
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5206706/
https://www.ncbi.nlm.nih.gov/pubmed/28045081
http://dx.doi.org/10.1038/srep39921
_version_ 1782490286942846976
author Tung, Po-Yuan
Blischak, John D.
Hsiao, Chiaowen Joyce
Knowles, David A.
Burnett, Jonathan E.
Pritchard, Jonathan K.
Gilad, Yoav
author_facet Tung, Po-Yuan
Blischak, John D.
Hsiao, Chiaowen Joyce
Knowles, David A.
Burnett, Jonathan E.
Pritchard, Jonathan K.
Gilad, Yoav
author_sort Tung, Po-Yuan
collection PubMed
description Single-cell RNA sequencing (scRNA-seq) can be used to characterize variation in gene expression levels at high resolution. However, the sources of experimental noise in scRNA-seq are not yet well understood. We investigated the technical variation associated with sample processing using the single-cell Fluidigm C1 platform. To do so, we processed three C1 replicates from three human induced pluripotent stem cell (iPSC) lines. We added unique molecular identifiers (UMIs) to all samples, to account for amplification bias. We found that the major source of variation in the gene expression data was driven by genotype, but we also observed substantial variation between the technical replicates. We observed that the conversion of reads to molecules using the UMIs was impacted by both biological and technical variation, indicating that UMI counts are not an unbiased estimator of gene expression levels. Based on our results, we suggest a framework for effective scRNA-seq studies.
format Online
Article
Text
id pubmed-5206706
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-52067062017-01-04 Batch effects and the effective design of single-cell gene expression studies Tung, Po-Yuan Blischak, John D. Hsiao, Chiaowen Joyce Knowles, David A. Burnett, Jonathan E. Pritchard, Jonathan K. Gilad, Yoav Sci Rep Article Single-cell RNA sequencing (scRNA-seq) can be used to characterize variation in gene expression levels at high resolution. However, the sources of experimental noise in scRNA-seq are not yet well understood. We investigated the technical variation associated with sample processing using the single-cell Fluidigm C1 platform. To do so, we processed three C1 replicates from three human induced pluripotent stem cell (iPSC) lines. We added unique molecular identifiers (UMIs) to all samples, to account for amplification bias. We found that the major source of variation in the gene expression data was driven by genotype, but we also observed substantial variation between the technical replicates. We observed that the conversion of reads to molecules using the UMIs was impacted by both biological and technical variation, indicating that UMI counts are not an unbiased estimator of gene expression levels. Based on our results, we suggest a framework for effective scRNA-seq studies. Nature Publishing Group 2017-01-03 /pmc/articles/PMC5206706/ /pubmed/28045081 http://dx.doi.org/10.1038/srep39921 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Tung, Po-Yuan
Blischak, John D.
Hsiao, Chiaowen Joyce
Knowles, David A.
Burnett, Jonathan E.
Pritchard, Jonathan K.
Gilad, Yoav
Batch effects and the effective design of single-cell gene expression studies
title Batch effects and the effective design of single-cell gene expression studies
title_full Batch effects and the effective design of single-cell gene expression studies
title_fullStr Batch effects and the effective design of single-cell gene expression studies
title_full_unstemmed Batch effects and the effective design of single-cell gene expression studies
title_short Batch effects and the effective design of single-cell gene expression studies
title_sort batch effects and the effective design of single-cell gene expression studies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5206706/
https://www.ncbi.nlm.nih.gov/pubmed/28045081
http://dx.doi.org/10.1038/srep39921
work_keys_str_mv AT tungpoyuan batcheffectsandtheeffectivedesignofsinglecellgeneexpressionstudies
AT blischakjohnd batcheffectsandtheeffectivedesignofsinglecellgeneexpressionstudies
AT hsiaochiaowenjoyce batcheffectsandtheeffectivedesignofsinglecellgeneexpressionstudies
AT knowlesdavida batcheffectsandtheeffectivedesignofsinglecellgeneexpressionstudies
AT burnettjonathane batcheffectsandtheeffectivedesignofsinglecellgeneexpressionstudies
AT pritchardjonathank batcheffectsandtheeffectivedesignofsinglecellgeneexpressionstudies
AT giladyoav batcheffectsandtheeffectivedesignofsinglecellgeneexpressionstudies