Cargando…

Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data

BACKGROUND: Many functional analysis tools have been developed to extract functional and mechanistic insight from bulk transcriptome data. With the advent of single-cell RNA sequencing (scRNA-seq), it is in principle possible to do such an analysis for single cells. However, scRNA-seq data has chara...

Descripción completa

Detalles Bibliográficos
Autores principales: Holland, Christian H., Tanevski, Jovan, Perales-Patón, Javier, Gleixner, Jan, Kumar, Manu P., Mereu, Elisabetta, Joughin, Brian A., Stegle, Oliver, Lauffenburger, Douglas A., Heyn, Holger, Szalai, Bence, Saez-Rodriguez, Julio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7017576/
https://www.ncbi.nlm.nih.gov/pubmed/32051003
http://dx.doi.org/10.1186/s13059-020-1949-z
_version_ 1783497224417705984
author Holland, Christian H.
Tanevski, Jovan
Perales-Patón, Javier
Gleixner, Jan
Kumar, Manu P.
Mereu, Elisabetta
Joughin, Brian A.
Stegle, Oliver
Lauffenburger, Douglas A.
Heyn, Holger
Szalai, Bence
Saez-Rodriguez, Julio
author_facet Holland, Christian H.
Tanevski, Jovan
Perales-Patón, Javier
Gleixner, Jan
Kumar, Manu P.
Mereu, Elisabetta
Joughin, Brian A.
Stegle, Oliver
Lauffenburger, Douglas A.
Heyn, Holger
Szalai, Bence
Saez-Rodriguez, Julio
author_sort Holland, Christian H.
collection PubMed
description BACKGROUND: Many functional analysis tools have been developed to extract functional and mechanistic insight from bulk transcriptome data. With the advent of single-cell RNA sequencing (scRNA-seq), it is in principle possible to do such an analysis for single cells. However, scRNA-seq data has characteristics such as drop-out events and low library sizes. It is thus not clear if functional TF and pathway analysis tools established for bulk sequencing can be applied to scRNA-seq in a meaningful way. RESULTS: To address this question, we perform benchmark studies on simulated and real scRNA-seq data. We include the bulk-RNA tools PROGENy, GO enrichment, and DoRothEA that estimate pathway and transcription factor (TF) activities, respectively, and compare them against the tools SCENIC/AUCell and metaVIPER, designed for scRNA-seq. For the in silico study, we simulate single cells from TF/pathway perturbation bulk RNA-seq experiments. We complement the simulated data with real scRNA-seq data upon CRISPR-mediated knock-out. Our benchmarks on simulated and real data reveal comparable performance to the original bulk data. Additionally, we show that the TF and pathway activities preserve cell type-specific variability by analyzing a mixture sample sequenced with 13 scRNA-seq protocols. We also provide the benchmark data for further use by the community. CONCLUSIONS: Our analyses suggest that bulk-based functional analysis tools that use manually curated footprint gene sets can be applied to scRNA-seq data, partially outperforming dedicated single-cell tools. Furthermore, we find that the performance of functional analysis tools is more sensitive to the gene sets than to the statistic used.
format Online
Article
Text
id pubmed-7017576
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70175762020-02-20 Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data Holland, Christian H. Tanevski, Jovan Perales-Patón, Javier Gleixner, Jan Kumar, Manu P. Mereu, Elisabetta Joughin, Brian A. Stegle, Oliver Lauffenburger, Douglas A. Heyn, Holger Szalai, Bence Saez-Rodriguez, Julio Genome Biol Research BACKGROUND: Many functional analysis tools have been developed to extract functional and mechanistic insight from bulk transcriptome data. With the advent of single-cell RNA sequencing (scRNA-seq), it is in principle possible to do such an analysis for single cells. However, scRNA-seq data has characteristics such as drop-out events and low library sizes. It is thus not clear if functional TF and pathway analysis tools established for bulk sequencing can be applied to scRNA-seq in a meaningful way. RESULTS: To address this question, we perform benchmark studies on simulated and real scRNA-seq data. We include the bulk-RNA tools PROGENy, GO enrichment, and DoRothEA that estimate pathway and transcription factor (TF) activities, respectively, and compare them against the tools SCENIC/AUCell and metaVIPER, designed for scRNA-seq. For the in silico study, we simulate single cells from TF/pathway perturbation bulk RNA-seq experiments. We complement the simulated data with real scRNA-seq data upon CRISPR-mediated knock-out. Our benchmarks on simulated and real data reveal comparable performance to the original bulk data. Additionally, we show that the TF and pathway activities preserve cell type-specific variability by analyzing a mixture sample sequenced with 13 scRNA-seq protocols. We also provide the benchmark data for further use by the community. CONCLUSIONS: Our analyses suggest that bulk-based functional analysis tools that use manually curated footprint gene sets can be applied to scRNA-seq data, partially outperforming dedicated single-cell tools. Furthermore, we find that the performance of functional analysis tools is more sensitive to the gene sets than to the statistic used. BioMed Central 2020-02-12 /pmc/articles/PMC7017576/ /pubmed/32051003 http://dx.doi.org/10.1186/s13059-020-1949-z Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Holland, Christian H.
Tanevski, Jovan
Perales-Patón, Javier
Gleixner, Jan
Kumar, Manu P.
Mereu, Elisabetta
Joughin, Brian A.
Stegle, Oliver
Lauffenburger, Douglas A.
Heyn, Holger
Szalai, Bence
Saez-Rodriguez, Julio
Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data
title Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data
title_full Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data
title_fullStr Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data
title_full_unstemmed Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data
title_short Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data
title_sort robustness and applicability of transcription factor and pathway analysis tools on single-cell rna-seq data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7017576/
https://www.ncbi.nlm.nih.gov/pubmed/32051003
http://dx.doi.org/10.1186/s13059-020-1949-z
work_keys_str_mv AT hollandchristianh robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT tanevskijovan robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT peralespatonjavier robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT gleixnerjan robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT kumarmanup robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT mereuelisabetta robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT joughinbriana robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT stegleoliver robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT lauffenburgerdouglasa robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT heynholger robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT szalaibence robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata
AT saezrodriguezjulio robustnessandapplicabilityoftranscriptionfactorandpathwayanalysistoolsonsinglecellrnaseqdata