Cargando…

scDC: single cell differential composition analysis

BACKGROUND: Differences in cell-type composition across subjects and conditions often carry biological significance. Recent advancements in single cell sequencing technologies enable cell-types to be identified at the single cell level, and as a result, cell-type composition of tissues can now be st...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Yue, Lin, Yingxin, Ormerod, John T., Yang, Pengyi, Yang, Jean Y.H., Lo, Kitty K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6929335/
https://www.ncbi.nlm.nih.gov/pubmed/31870280
http://dx.doi.org/10.1186/s12859-019-3211-9
_version_ 1783482680286904320
author Cao, Yue
Lin, Yingxin
Ormerod, John T.
Yang, Pengyi
Yang, Jean Y.H.
Lo, Kitty K.
author_facet Cao, Yue
Lin, Yingxin
Ormerod, John T.
Yang, Pengyi
Yang, Jean Y.H.
Lo, Kitty K.
author_sort Cao, Yue
collection PubMed
description BACKGROUND: Differences in cell-type composition across subjects and conditions often carry biological significance. Recent advancements in single cell sequencing technologies enable cell-types to be identified at the single cell level, and as a result, cell-type composition of tissues can now be studied in exquisite detail. However, a number of challenges remain with cell-type composition analysis – none of the existing methods can identify cell-type perfectly and variability related to cell sampling exists in any single cell experiment. This necessitates the development of method for estimating uncertainty in cell-type composition. RESULTS: We developed a novel single cell differential composition (scDC) analysis method that performs differential cell-type composition analysis via bootstrap resampling. scDC captures the uncertainty associated with cell-type proportions of each subject via bias-corrected and accelerated bootstrap confidence intervals. We assessed the performance of our method using a number of simulated datasets and synthetic datasets curated from publicly available single cell datasets. In simulated datasets, scDC correctly recovered the true cell-type proportions. In synthetic datasets, the cell-type compositions returned by scDC were highly concordant with reference cell-type compositions from the original data. Since the majority of datasets tested in this study have only 2 to 5 subjects per condition, the addition of confidence intervals enabled better comparisons of compositional differences between subjects and across conditions. CONCLUSIONS: scDC is a novel statistical method for performing differential cell-type composition analysis for scRNA-seq data. It uses bootstrap resampling to estimate the standard errors associated with cell-type proportion estimates and performs significance testing through GLM and GLMM models. We have made this method available to the scientific community as part of the scdney package (Single Cell Data Integrative Analysis) R package, available from https://github.com/SydneyBioX/scdney.
format Online
Article
Text
id pubmed-6929335
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69293352019-12-30 scDC: single cell differential composition analysis Cao, Yue Lin, Yingxin Ormerod, John T. Yang, Pengyi Yang, Jean Y.H. Lo, Kitty K. BMC Bioinformatics Research BACKGROUND: Differences in cell-type composition across subjects and conditions often carry biological significance. Recent advancements in single cell sequencing technologies enable cell-types to be identified at the single cell level, and as a result, cell-type composition of tissues can now be studied in exquisite detail. However, a number of challenges remain with cell-type composition analysis – none of the existing methods can identify cell-type perfectly and variability related to cell sampling exists in any single cell experiment. This necessitates the development of method for estimating uncertainty in cell-type composition. RESULTS: We developed a novel single cell differential composition (scDC) analysis method that performs differential cell-type composition analysis via bootstrap resampling. scDC captures the uncertainty associated with cell-type proportions of each subject via bias-corrected and accelerated bootstrap confidence intervals. We assessed the performance of our method using a number of simulated datasets and synthetic datasets curated from publicly available single cell datasets. In simulated datasets, scDC correctly recovered the true cell-type proportions. In synthetic datasets, the cell-type compositions returned by scDC were highly concordant with reference cell-type compositions from the original data. Since the majority of datasets tested in this study have only 2 to 5 subjects per condition, the addition of confidence intervals enabled better comparisons of compositional differences between subjects and across conditions. CONCLUSIONS: scDC is a novel statistical method for performing differential cell-type composition analysis for scRNA-seq data. It uses bootstrap resampling to estimate the standard errors associated with cell-type proportion estimates and performs significance testing through GLM and GLMM models. We have made this method available to the scientific community as part of the scdney package (Single Cell Data Integrative Analysis) R package, available from https://github.com/SydneyBioX/scdney. BioMed Central 2019-12-24 /pmc/articles/PMC6929335/ /pubmed/31870280 http://dx.doi.org/10.1186/s12859-019-3211-9 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Cao, Yue
Lin, Yingxin
Ormerod, John T.
Yang, Pengyi
Yang, Jean Y.H.
Lo, Kitty K.
scDC: single cell differential composition analysis
title scDC: single cell differential composition analysis
title_full scDC: single cell differential composition analysis
title_fullStr scDC: single cell differential composition analysis
title_full_unstemmed scDC: single cell differential composition analysis
title_short scDC: single cell differential composition analysis
title_sort scdc: single cell differential composition analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6929335/
https://www.ncbi.nlm.nih.gov/pubmed/31870280
http://dx.doi.org/10.1186/s12859-019-3211-9
work_keys_str_mv AT caoyue scdcsinglecelldifferentialcompositionanalysis
AT linyingxin scdcsinglecelldifferentialcompositionanalysis
AT ormerodjohnt scdcsinglecelldifferentialcompositionanalysis
AT yangpengyi scdcsinglecelldifferentialcompositionanalysis
AT yangjeanyh scdcsinglecelldifferentialcompositionanalysis
AT lokittyk scdcsinglecelldifferentialcompositionanalysis