Cargando…
DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data
BACKGROUND: Single-cell RNA sequencing (scRNA-seq) unfolds complex transcriptomic datasets into detailed cellular maps. Despite recent success, there is a pressing need for specialized methods tailored towards the functional interpretation of these cellular maps. FINDINGS: Here, we present DrivAER,...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7727875/ https://www.ncbi.nlm.nih.gov/pubmed/33301553 http://dx.doi.org/10.1093/gigascience/giaa122 |
_version_ | 1783621147110146048 |
---|---|
author | Simon, Lukas M Yan, Fangfang Zhao, Zhongming |
author_facet | Simon, Lukas M Yan, Fangfang Zhao, Zhongming |
author_sort | Simon, Lukas M |
collection | PubMed |
description | BACKGROUND: Single-cell RNA sequencing (scRNA-seq) unfolds complex transcriptomic datasets into detailed cellular maps. Despite recent success, there is a pressing need for specialized methods tailored towards the functional interpretation of these cellular maps. FINDINGS: Here, we present DrivAER, a machine learning approach for the identification of driving transcriptional programs using autoencoder-based relevance scores. DrivAER scores annotated gene sets on the basis of their relevance to user-specified outcomes such as pseudotemporal ordering or disease status. DrivAER iteratively evaluates the information content of each gene set with respect to the outcome variable using autoencoders. We benchmark our method using extensive simulation analysis as well as comparison to existing methods for functional interpretation of scRNA-seq data. Furthermore, we demonstrate that DrivAER extracts key pathways and transcription factors that regulate complex biological processes from scRNA-seq data. CONCLUSIONS: By quantifying the relevance of annotated gene sets with respect to specified outcome variables, DrivAER greatly enhances our ability to understand the underlying molecular mechanisms. |
format | Online Article Text |
id | pubmed-7727875 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-77278752020-12-16 DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data Simon, Lukas M Yan, Fangfang Zhao, Zhongming Gigascience Technical Note BACKGROUND: Single-cell RNA sequencing (scRNA-seq) unfolds complex transcriptomic datasets into detailed cellular maps. Despite recent success, there is a pressing need for specialized methods tailored towards the functional interpretation of these cellular maps. FINDINGS: Here, we present DrivAER, a machine learning approach for the identification of driving transcriptional programs using autoencoder-based relevance scores. DrivAER scores annotated gene sets on the basis of their relevance to user-specified outcomes such as pseudotemporal ordering or disease status. DrivAER iteratively evaluates the information content of each gene set with respect to the outcome variable using autoencoders. We benchmark our method using extensive simulation analysis as well as comparison to existing methods for functional interpretation of scRNA-seq data. Furthermore, we demonstrate that DrivAER extracts key pathways and transcription factors that regulate complex biological processes from scRNA-seq data. CONCLUSIONS: By quantifying the relevance of annotated gene sets with respect to specified outcome variables, DrivAER greatly enhances our ability to understand the underlying molecular mechanisms. Oxford University Press 2020-12-10 /pmc/articles/PMC7727875/ /pubmed/33301553 http://dx.doi.org/10.1093/gigascience/giaa122 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of GigaScience. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note Simon, Lukas M Yan, Fangfang Zhao, Zhongming DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data |
title | DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data |
title_full | DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data |
title_fullStr | DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data |
title_full_unstemmed | DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data |
title_short | DrivAER: Identification of driving transcriptional programs in single-cell RNA sequencing data |
title_sort | drivaer: identification of driving transcriptional programs in single-cell rna sequencing data |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7727875/ https://www.ncbi.nlm.nih.gov/pubmed/33301553 http://dx.doi.org/10.1093/gigascience/giaa122 |
work_keys_str_mv | AT simonlukasm drivaeridentificationofdrivingtranscriptionalprogramsinsinglecellrnasequencingdata AT yanfangfang drivaeridentificationofdrivingtranscriptionalprogramsinsinglecellrnasequencingdata AT zhaozhongming drivaeridentificationofdrivingtranscriptionalprogramsinsinglecellrnasequencingdata |