Cargando…

Deep learning of the tissue-regulated splicing code

Motivation: Alternative splicing (AS) is a regulated process that directs the generation of different transcripts from single genes. A computational model that can accurately predict splicing patterns based on genomic features and cellular context is highly desirable, both in understanding this wide...

Descripción completa

Detalles Bibliográficos
Autores principales: Leung, Michael K. K., Xiong, Hui Yuan, Lee, Leo J., Frey, Brendan J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058935/
https://www.ncbi.nlm.nih.gov/pubmed/24931975
http://dx.doi.org/10.1093/bioinformatics/btu277
_version_ 1782321189638635520
author Leung, Michael K. K.
Xiong, Hui Yuan
Lee, Leo J.
Frey, Brendan J.
author_facet Leung, Michael K. K.
Xiong, Hui Yuan
Lee, Leo J.
Frey, Brendan J.
author_sort Leung, Michael K. K.
collection PubMed
description Motivation: Alternative splicing (AS) is a regulated process that directs the generation of different transcripts from single genes. A computational model that can accurately predict splicing patterns based on genomic features and cellular context is highly desirable, both in understanding this widespread phenomenon, and in exploring the effects of genetic variations on AS. Methods: Using a deep neural network, we developed a model inferred from mouse RNA-Seq data that can predict splicing patterns in individual tissues and differences in splicing patterns across tissues. Our architecture uses hidden variables that jointly represent features in genomic sequences and tissue types when making predictions. A graphics processing unit was used to greatly reduce the training time of our models with millions of parameters. Results: We show that the deep architecture surpasses the performance of the previous Bayesian method for predicting AS patterns. With the proper optimization procedure and selection of hyperparameters, we demonstrate that deep architectures can be beneficial, even with a moderately sparse dataset. An analysis of what the model has learned in terms of the genomic features is presented. Contact: frey@psi.toronto.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4058935
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40589352014-06-18 Deep learning of the tissue-regulated splicing code Leung, Michael K. K. Xiong, Hui Yuan Lee, Leo J. Frey, Brendan J. Bioinformatics Ismb 2014 Proceedings Papers Committee Motivation: Alternative splicing (AS) is a regulated process that directs the generation of different transcripts from single genes. A computational model that can accurately predict splicing patterns based on genomic features and cellular context is highly desirable, both in understanding this widespread phenomenon, and in exploring the effects of genetic variations on AS. Methods: Using a deep neural network, we developed a model inferred from mouse RNA-Seq data that can predict splicing patterns in individual tissues and differences in splicing patterns across tissues. Our architecture uses hidden variables that jointly represent features in genomic sequences and tissue types when making predictions. A graphics processing unit was used to greatly reduce the training time of our models with millions of parameters. Results: We show that the deep architecture surpasses the performance of the previous Bayesian method for predicting AS patterns. With the proper optimization procedure and selection of hyperparameters, we demonstrate that deep architectures can be beneficial, even with a moderately sparse dataset. An analysis of what the model has learned in terms of the genomic features is presented. Contact: frey@psi.toronto.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-06-15 2014-06-11 /pmc/articles/PMC4058935/ /pubmed/24931975 http://dx.doi.org/10.1093/bioinformatics/btu277 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb 2014 Proceedings Papers Committee
Leung, Michael K. K.
Xiong, Hui Yuan
Lee, Leo J.
Frey, Brendan J.
Deep learning of the tissue-regulated splicing code
title Deep learning of the tissue-regulated splicing code
title_full Deep learning of the tissue-regulated splicing code
title_fullStr Deep learning of the tissue-regulated splicing code
title_full_unstemmed Deep learning of the tissue-regulated splicing code
title_short Deep learning of the tissue-regulated splicing code
title_sort deep learning of the tissue-regulated splicing code
topic Ismb 2014 Proceedings Papers Committee
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058935/
https://www.ncbi.nlm.nih.gov/pubmed/24931975
http://dx.doi.org/10.1093/bioinformatics/btu277
work_keys_str_mv AT leungmichaelkk deeplearningofthetissueregulatedsplicingcode
AT xionghuiyuan deeplearningofthetissueregulatedsplicingcode
AT leeleoj deeplearningofthetissueregulatedsplicingcode
AT freybrendanj deeplearningofthetissueregulatedsplicingcode