Cargando…

Exploring generative deep learning for omics data using log-linear models

MOTIVATION: Following many successful applications to image data, deep learning is now also increasingly considered for omics data. In particular, generative deep learning not only provides competitive prediction performance, but also allows for uncovering structure by generating synthetic samples....

Descripción completa

Detalles Bibliográficos
Autores principales: Hess, Moritz, Hackenberg, Maren, Binder, Harald
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755415/
https://www.ncbi.nlm.nih.gov/pubmed/32647888
http://dx.doi.org/10.1093/bioinformatics/btaa623
_version_ 1783626350877212672
author Hess, Moritz
Hackenberg, Maren
Binder, Harald
author_facet Hess, Moritz
Hackenberg, Maren
Binder, Harald
author_sort Hess, Moritz
collection PubMed
description MOTIVATION: Following many successful applications to image data, deep learning is now also increasingly considered for omics data. In particular, generative deep learning not only provides competitive prediction performance, but also allows for uncovering structure by generating synthetic samples. However, exploration and visualization is not as straightforward as with image applications. RESULTS: We demonstrate how log-linear models, fitted to the generated, synthetic data can be used to extract patterns from omics data, learned by deep generative techniques. Specifically, interactions between latent representations learned by the approaches and generated synthetic data are used to determine sets of joint patterns. Distances of patterns with respect to the distribution of latent representations are then visualized in low-dimensional coordinate systems, e.g. for monitoring training progress. This is illustrated with simulated data and subsequently with cortical single-cell gene expression data. Using different kinds of deep generative techniques, specifically variational autoencoders and deep Boltzmann machines, the proposed approach highlights how the techniques uncover underlying structure. It facilitates the real-world use of such generative deep learning techniques to gain biological insights from omics data. AVAILABILITY AND IMPLEMENTATION: The code for the approach as well as an accompanying Jupyter notebook, which illustrates the application of our approach, is available via the GitHub repository: https://github.com/ssehztirom/Exploring-generative-deep-learning-for-omics-data-by-using-log-linear-models. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7755415
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77554152020-12-29 Exploring generative deep learning for omics data using log-linear models Hess, Moritz Hackenberg, Maren Binder, Harald Bioinformatics Original Papers MOTIVATION: Following many successful applications to image data, deep learning is now also increasingly considered for omics data. In particular, generative deep learning not only provides competitive prediction performance, but also allows for uncovering structure by generating synthetic samples. However, exploration and visualization is not as straightforward as with image applications. RESULTS: We demonstrate how log-linear models, fitted to the generated, synthetic data can be used to extract patterns from omics data, learned by deep generative techniques. Specifically, interactions between latent representations learned by the approaches and generated synthetic data are used to determine sets of joint patterns. Distances of patterns with respect to the distribution of latent representations are then visualized in low-dimensional coordinate systems, e.g. for monitoring training progress. This is illustrated with simulated data and subsequently with cortical single-cell gene expression data. Using different kinds of deep generative techniques, specifically variational autoencoders and deep Boltzmann machines, the proposed approach highlights how the techniques uncover underlying structure. It facilitates the real-world use of such generative deep learning techniques to gain biological insights from omics data. AVAILABILITY AND IMPLEMENTATION: The code for the approach as well as an accompanying Jupyter notebook, which illustrates the application of our approach, is available via the GitHub repository: https://github.com/ssehztirom/Exploring-generative-deep-learning-for-omics-data-by-using-log-linear-models. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-08-01 /pmc/articles/PMC7755415/ /pubmed/32647888 http://dx.doi.org/10.1093/bioinformatics/btaa623 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Hess, Moritz
Hackenberg, Maren
Binder, Harald
Exploring generative deep learning for omics data using log-linear models
title Exploring generative deep learning for omics data using log-linear models
title_full Exploring generative deep learning for omics data using log-linear models
title_fullStr Exploring generative deep learning for omics data using log-linear models
title_full_unstemmed Exploring generative deep learning for omics data using log-linear models
title_short Exploring generative deep learning for omics data using log-linear models
title_sort exploring generative deep learning for omics data using log-linear models
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755415/
https://www.ncbi.nlm.nih.gov/pubmed/32647888
http://dx.doi.org/10.1093/bioinformatics/btaa623
work_keys_str_mv AT hessmoritz exploringgenerativedeeplearningforomicsdatausingloglinearmodels
AT hackenbergmaren exploringgenerativedeeplearningforomicsdatausingloglinearmodels
AT binderharald exploringgenerativedeeplearningforomicsdatausingloglinearmodels