Cargando…
Exploring generative deep learning for omics data using log-linear models
MOTIVATION: Following many successful applications to image data, deep learning is now also increasingly considered for omics data. In particular, generative deep learning not only provides competitive prediction performance, but also allows for uncovering structure by generating synthetic samples....
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755415/ https://www.ncbi.nlm.nih.gov/pubmed/32647888 http://dx.doi.org/10.1093/bioinformatics/btaa623 |
_version_ | 1783626350877212672 |
---|---|
author | Hess, Moritz Hackenberg, Maren Binder, Harald |
author_facet | Hess, Moritz Hackenberg, Maren Binder, Harald |
author_sort | Hess, Moritz |
collection | PubMed |
description | MOTIVATION: Following many successful applications to image data, deep learning is now also increasingly considered for omics data. In particular, generative deep learning not only provides competitive prediction performance, but also allows for uncovering structure by generating synthetic samples. However, exploration and visualization is not as straightforward as with image applications. RESULTS: We demonstrate how log-linear models, fitted to the generated, synthetic data can be used to extract patterns from omics data, learned by deep generative techniques. Specifically, interactions between latent representations learned by the approaches and generated synthetic data are used to determine sets of joint patterns. Distances of patterns with respect to the distribution of latent representations are then visualized in low-dimensional coordinate systems, e.g. for monitoring training progress. This is illustrated with simulated data and subsequently with cortical single-cell gene expression data. Using different kinds of deep generative techniques, specifically variational autoencoders and deep Boltzmann machines, the proposed approach highlights how the techniques uncover underlying structure. It facilitates the real-world use of such generative deep learning techniques to gain biological insights from omics data. AVAILABILITY AND IMPLEMENTATION: The code for the approach as well as an accompanying Jupyter notebook, which illustrates the application of our approach, is available via the GitHub repository: https://github.com/ssehztirom/Exploring-generative-deep-learning-for-omics-data-by-using-log-linear-models. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-7755415 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-77554152020-12-29 Exploring generative deep learning for omics data using log-linear models Hess, Moritz Hackenberg, Maren Binder, Harald Bioinformatics Original Papers MOTIVATION: Following many successful applications to image data, deep learning is now also increasingly considered for omics data. In particular, generative deep learning not only provides competitive prediction performance, but also allows for uncovering structure by generating synthetic samples. However, exploration and visualization is not as straightforward as with image applications. RESULTS: We demonstrate how log-linear models, fitted to the generated, synthetic data can be used to extract patterns from omics data, learned by deep generative techniques. Specifically, interactions between latent representations learned by the approaches and generated synthetic data are used to determine sets of joint patterns. Distances of patterns with respect to the distribution of latent representations are then visualized in low-dimensional coordinate systems, e.g. for monitoring training progress. This is illustrated with simulated data and subsequently with cortical single-cell gene expression data. Using different kinds of deep generative techniques, specifically variational autoencoders and deep Boltzmann machines, the proposed approach highlights how the techniques uncover underlying structure. It facilitates the real-world use of such generative deep learning techniques to gain biological insights from omics data. AVAILABILITY AND IMPLEMENTATION: The code for the approach as well as an accompanying Jupyter notebook, which illustrates the application of our approach, is available via the GitHub repository: https://github.com/ssehztirom/Exploring-generative-deep-learning-for-omics-data-by-using-log-linear-models. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-08-01 /pmc/articles/PMC7755415/ /pubmed/32647888 http://dx.doi.org/10.1093/bioinformatics/btaa623 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Hess, Moritz Hackenberg, Maren Binder, Harald Exploring generative deep learning for omics data using log-linear models |
title | Exploring generative deep learning for omics data using log-linear models |
title_full | Exploring generative deep learning for omics data using log-linear models |
title_fullStr | Exploring generative deep learning for omics data using log-linear models |
title_full_unstemmed | Exploring generative deep learning for omics data using log-linear models |
title_short | Exploring generative deep learning for omics data using log-linear models |
title_sort | exploring generative deep learning for omics data using log-linear models |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755415/ https://www.ncbi.nlm.nih.gov/pubmed/32647888 http://dx.doi.org/10.1093/bioinformatics/btaa623 |
work_keys_str_mv | AT hessmoritz exploringgenerativedeeplearningforomicsdatausingloglinearmodels AT hackenbergmaren exploringgenerativedeeplearningforomicsdatausingloglinearmodels AT binderharald exploringgenerativedeeplearningforomicsdatausingloglinearmodels |