Cargando…

Shallow Sparsely-Connected Autoencoders for Gene Set Projection

When analyzing biological data, it can be helpful to consider gene sets, or predefined groups of biologically related genes. Methods exist for identifying gene sets that are differential between conditions, but large public datasets from consortium projects and single-cell RNA-Sequencing have opened...

Descripción completa

Detalles Bibliográficos
Autores principales: Gold, Maxwell P., LeNail, Alexander, Fraenkel, Ernest
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417803/
https://www.ncbi.nlm.nih.gov/pubmed/30963076
_version_ 1783403626074472448
author Gold, Maxwell P.
LeNail, Alexander
Fraenkel, Ernest
author_facet Gold, Maxwell P.
LeNail, Alexander
Fraenkel, Ernest
author_sort Gold, Maxwell P.
collection PubMed
description When analyzing biological data, it can be helpful to consider gene sets, or predefined groups of biologically related genes. Methods exist for identifying gene sets that are differential between conditions, but large public datasets from consortium projects and single-cell RNA-Sequencing have opened the door for gene set analysis using more sophisticated machine learning techniques, such as autoencoders and variational autoencoders. We present shallow sparsely-connected autoencoders (SSCAs) and variational autoencoders (SSCVAs) as tools for projecting gene-level data onto gene sets. We tested these approaches on single-cell RNA-Sequencing data from blood cells and on RNA-Sequencing data from breast cancer patients. Both SSCA and SSCVA can recover known biological features from these datasets and the SSCVA method often outperforms SSCA (and six existing gene set scoring algorithms) on classification and prediction tasks.
format Online
Article
Text
id pubmed-6417803
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-64178032019-03-14 Shallow Sparsely-Connected Autoencoders for Gene Set Projection Gold, Maxwell P. LeNail, Alexander Fraenkel, Ernest Pac Symp Biocomput Article When analyzing biological data, it can be helpful to consider gene sets, or predefined groups of biologically related genes. Methods exist for identifying gene sets that are differential between conditions, but large public datasets from consortium projects and single-cell RNA-Sequencing have opened the door for gene set analysis using more sophisticated machine learning techniques, such as autoencoders and variational autoencoders. We present shallow sparsely-connected autoencoders (SSCAs) and variational autoencoders (SSCVAs) as tools for projecting gene-level data onto gene sets. We tested these approaches on single-cell RNA-Sequencing data from blood cells and on RNA-Sequencing data from breast cancer patients. Both SSCA and SSCVA can recover known biological features from these datasets and the SSCVA method often outperforms SSCA (and six existing gene set scoring algorithms) on classification and prediction tasks. 2019 /pmc/articles/PMC6417803/ /pubmed/30963076 Text en https://creativecommons.org/licenses/by-nc/4.0/Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License.
spellingShingle Article
Gold, Maxwell P.
LeNail, Alexander
Fraenkel, Ernest
Shallow Sparsely-Connected Autoencoders for Gene Set Projection
title Shallow Sparsely-Connected Autoencoders for Gene Set Projection
title_full Shallow Sparsely-Connected Autoencoders for Gene Set Projection
title_fullStr Shallow Sparsely-Connected Autoencoders for Gene Set Projection
title_full_unstemmed Shallow Sparsely-Connected Autoencoders for Gene Set Projection
title_short Shallow Sparsely-Connected Autoencoders for Gene Set Projection
title_sort shallow sparsely-connected autoencoders for gene set projection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417803/
https://www.ncbi.nlm.nih.gov/pubmed/30963076
work_keys_str_mv AT goldmaxwellp shallowsparselyconnectedautoencodersforgenesetprojection
AT lenailalexander shallowsparselyconnectedautoencodersforgenesetprojection
AT fraenkelernest shallowsparselyconnectedautoencodersforgenesetprojection