Cargando…

DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data

BACKGROUND: Single-cell RNA sequencing (scRNA-seq) unfolds complex transcriptomic datasets into detailed cellular maps. Despite recent success, there is a pressing need for specialized methods tailored towards the functional interpretation of these cellular maps. FINDINGS: Here, we present DrivAER,...

Descripción completa

Detalles Bibliográficos
Autores principales: Simon, Lukas M, Yan, Fangfang, Zhao, Zhongming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7727875/
https://www.ncbi.nlm.nih.gov/pubmed/33301553
http://dx.doi.org/10.1093/gigascience/giaa122
_version_ 1783621147110146048
author Simon, Lukas M
Yan, Fangfang
Zhao, Zhongming
author_facet Simon, Lukas M
Yan, Fangfang
Zhao, Zhongming
author_sort Simon, Lukas M
collection PubMed
description BACKGROUND: Single-cell RNA sequencing (scRNA-seq) unfolds complex transcriptomic datasets into detailed cellular maps. Despite recent success, there is a pressing need for specialized methods tailored towards the functional interpretation of these cellular maps. FINDINGS: Here, we present DrivAER, a machine learning approach for the identification of driving transcriptional programs using autoencoder-based relevance scores. DrivAER scores annotated gene sets on the basis of their relevance to user-specified outcomes such as pseudotemporal ordering or disease status. DrivAER iteratively evaluates the information content of each gene set with respect to the outcome variable using autoencoders. We benchmark our method using extensive simulation analysis as well as comparison to existing methods for functional interpretation of scRNA-seq data. Furthermore, we demonstrate that DrivAER extracts key pathways and transcription factors that regulate complex biological processes from scRNA-seq data. CONCLUSIONS: By quantifying the relevance of annotated gene sets with respect to specified outcome variables, DrivAER greatly enhances our ability to understand the underlying molecular mechanisms.
format Online
Article
Text
id pubmed-7727875
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77278752020-12-16 DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data Simon, Lukas M Yan, Fangfang Zhao, Zhongming Gigascience Technical Note BACKGROUND: Single-cell RNA sequencing (scRNA-seq) unfolds complex transcriptomic datasets into detailed cellular maps. Despite recent success, there is a pressing need for specialized methods tailored towards the functional interpretation of these cellular maps. FINDINGS: Here, we present DrivAER, a machine learning approach for the identification of driving transcriptional programs using autoencoder-based relevance scores. DrivAER scores annotated gene sets on the basis of their relevance to user-specified outcomes such as pseudotemporal ordering or disease status. DrivAER iteratively evaluates the information content of each gene set with respect to the outcome variable using autoencoders. We benchmark our method using extensive simulation analysis as well as comparison to existing methods for functional interpretation of scRNA-seq data. Furthermore, we demonstrate that DrivAER extracts key pathways and transcription factors that regulate complex biological processes from scRNA-seq data. CONCLUSIONS: By quantifying the relevance of annotated gene sets with respect to specified outcome variables, DrivAER greatly enhances our ability to understand the underlying molecular mechanisms. Oxford University Press 2020-12-10 /pmc/articles/PMC7727875/ /pubmed/33301553 http://dx.doi.org/10.1093/gigascience/giaa122 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of GigaScience. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Simon, Lukas M
Yan, Fangfang
Zhao, Zhongming
DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data
title DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data
title_full DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data
title_fullStr DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data
title_full_unstemmed DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data
title_short DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data
title_sort drivaer: identification of driving transcriptional programs in single-cell rna sequencing data
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7727875/
https://www.ncbi.nlm.nih.gov/pubmed/33301553
http://dx.doi.org/10.1093/gigascience/giaa122
work_keys_str_mv AT simonlukasm drivaeridentificationofdrivingtranscriptionalprogramsinsinglecellrnasequencingdata
AT yanfangfang drivaeridentificationofdrivingtranscriptionalprogramsinsinglecellrnasequencingdata
AT zhaozhongming drivaeridentificationofdrivingtranscriptionalprogramsinsinglecellrnasequencingdata