Cargando…
Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data
BACKGROUND: Many functional analysis tools have been developed to extract functional and mechanistic insight from bulk transcriptome data. With the advent of single-cell RNA sequencing (scRNA-seq), it is in principle possible to do such an analysis for single cells. However, scRNA-seq data has chara...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7017576/ https://www.ncbi.nlm.nih.gov/pubmed/32051003 http://dx.doi.org/10.1186/s13059-020-1949-z |
_version_ | 1783497224417705984 |
---|---|
author | Holland, Christian H. Tanevski, Jovan Perales-Patón, Javier Gleixner, Jan Kumar, Manu P. Mereu, Elisabetta Joughin, Brian A. Stegle, Oliver Lauffenburger, Douglas A. Heyn, Holger Szalai, Bence Saez-Rodriguez, Julio |
author_facet | Holland, Christian H. Tanevski, Jovan Perales-Patón, Javier Gleixner, Jan Kumar, Manu P. Mereu, Elisabetta Joughin, Brian A. Stegle, Oliver Lauffenburger, Douglas A. Heyn, Holger Szalai, Bence Saez-Rodriguez, Julio |
author_sort | Holland, Christian H. |
collection | PubMed |
description | BACKGROUND: Many functional analysis tools have been developed to extract functional and mechanistic insight from bulk transcriptome data. With the advent of single-cell RNA sequencing (scRNA-seq), it is in principle possible to do such an analysis for single cells. However, scRNA-seq data has characteristics such as drop-out events and low library sizes. It is thus not clear if functional TF and pathway analysis tools established for bulk sequencing can be applied to scRNA-seq in a meaningful way. RESULTS: To address this question, we perform benchmark studies on simulated and real scRNA-seq data. We include the bulk-RNA tools PROGENy, GO enrichment, and DoRothEA that estimate pathway and transcription factor (TF) activities, respectively, and compare them against the tools SCENIC/AUCell and metaVIPER, designed for scRNA-seq. For the in silico study, we simulate single cells from TF/pathway perturbation bulk RNA-seq experiments. We complement the simulated data with real scRNA-seq data upon CRISPR-mediated knock-out. Our benchmarks on simulated and real data reveal comparable performance to the original bulk data. Additionally, we show that the TF and pathway activities preserve cell type-specific variability by analyzing a mixture sample sequenced with 13 scRNA-seq protocols. We also provide the benchmark data for further use by the community. CONCLUSIONS: Our analyses suggest that bulk-based functional analysis tools that use manually curated footprint gene sets can be applied to scRNA-seq data, partially outperforming dedicated single-cell tools. Furthermore, we find that the performance of functional analysis tools is more sensitive to the gene sets than to the statistic used. |
format | Online Article Text |
id | pubmed-7017576 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-70175762020-02-20 Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data Holland, Christian H. Tanevski, Jovan Perales-Patón, Javier Gleixner, Jan Kumar, Manu P. Mereu, Elisabetta Joughin, Brian A. Stegle, Oliver Lauffenburger, Douglas A. Heyn, Holger Szalai, Bence Saez-Rodriguez, Julio Genome Biol Research BACKGROUND: Many functional analysis tools have been developed to extract functional and mechanistic insight from bulk transcriptome data. With the advent of single-cell RNA sequencing (scRNA-seq), it is in principle possible to do such an analysis for single cells. However, scRNA-seq data has characteristics such as drop-out events and low library sizes. It is thus not clear if functional TF and pathway analysis tools established for bulk sequencing can be applied to scRNA-seq in a meaningful way. RESULTS: To address this question, we perform benchmark studies on simulated and real scRNA-seq data. We include the bulk-RNA tools PROGENy, GO enrichment, and DoRothEA that estimate pathway and transcription factor (TF) activities, respectively, and compare them against the tools SCENIC/AUCell and metaVIPER, designed for scRNA-seq. For the in silico study, we simulate single cells from TF/pathway perturbation bulk RNA-seq experiments. We complement the simulated data with real scRNA-seq data upon CRISPR-mediated knock-out. Our benchmarks on simulated and real data reveal comparable performance to the original bulk data. Additionally, we show that the TF and pathway activities preserve cell type-specific variability by analyzing a mixture sample sequenced with 13 scRNA-seq protocols. We also provide the benchmark data for further use by the community. CONCLUSIONS: Our analyses suggest that bulk-based functional analysis tools that use manually curated footprint gene sets can be applied to scRNA-seq data, partially outperforming dedicated single-cell tools. Furthermore, we find that the performance of functional analysis tools is more sensitive to the gene sets than to the statistic used. BioMed Central 2020-02-12 /pmc/articles/PMC7017576/ /pubmed/32051003 http://dx.doi.org/10.1186/s13059-020-1949-z Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Holland, Christian H. Tanevski, Jovan Perales-Patón, Javier Gleixner, Jan Kumar, Manu P. Mereu, Elisabetta Joughin, Brian A. Stegle, Oliver Lauffenburger, Douglas A. Heyn, Holger Szalai, Bence Saez-Rodriguez, Julio Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data |
title | Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data |
title_full | Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data |
title_fullStr | Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data |
title_full_unstemmed | Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data |
title_short | Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data |
title_sort | robustness and applicability of transcription factor and pathway analysis tools on single-cell rna-seq data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7017576/ https://www.ncbi.nlm.nih.gov/pubmed/32051003 http://dx.doi.org/10.1186/s13059-020-1949-z |
work_keys_str_mv | AT hollandchristianh robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT tanevskijovan robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT peralespatonjavier robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT gleixnerjan robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT kumarmanup robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT mereuelisabetta robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT joughinbriana robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT stegleoliver robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT lauffenburgerdouglasa robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT heynholger robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT szalaibence robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata AT saezrodriguezjulio robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata |