Cargando…

Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure

Understanding the genetic regulatory code governing gene expression is an important challenge in molecular biology. However, how individual coding and non-coding regions of the gene regulatory structure interact and contribute to mRNA expression levels remains unclear. Here we apply deep learning on...

Descripción completa

Detalles Bibliográficos
Autores principales: Zrimec, Jan, Börlin, Christoph S., Buric, Filip, Muhammad, Azam Sheikh, Chen, Rhongzen, Siewers, Verena, Verendel, Vilhelm, Nielsen, Jens, Töpel, Mats, Zelezniak, Aleksej
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7708451/
https://www.ncbi.nlm.nih.gov/pubmed/33262328
http://dx.doi.org/10.1038/s41467-020-19921-4
_version_ 1783617549162774528
author Zrimec, Jan
Börlin, Christoph S.
Buric, Filip
Muhammad, Azam Sheikh
Chen, Rhongzen
Siewers, Verena
Verendel, Vilhelm
Nielsen, Jens
Töpel, Mats
Zelezniak, Aleksej
author_facet Zrimec, Jan
Börlin, Christoph S.
Buric, Filip
Muhammad, Azam Sheikh
Chen, Rhongzen
Siewers, Verena
Verendel, Vilhelm
Nielsen, Jens
Töpel, Mats
Zelezniak, Aleksej
author_sort Zrimec, Jan
collection PubMed
description Understanding the genetic regulatory code governing gene expression is an important challenge in molecular biology. However, how individual coding and non-coding regions of the gene regulatory structure interact and contribute to mRNA expression levels remains unclear. Here we apply deep learning on over 20,000 mRNA datasets to examine the genetic regulatory code controlling mRNA abundance in 7 model organisms ranging from bacteria to Human. In all organisms, we can predict mRNA abundance directly from DNA sequence, with up to 82% of the variation of transcript levels encoded in the gene regulatory structure. By searching for DNA regulatory motifs across the gene regulatory structure, we discover that motif interactions could explain the whole dynamic range of mRNA levels. Co-evolution across coding and non-coding regions suggests that it is not single motifs or regions, but the entire gene regulatory structure and specific combination of regulatory elements that define gene expression levels.
format Online
Article
Text
id pubmed-7708451
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-77084512020-12-03 Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure Zrimec, Jan Börlin, Christoph S. Buric, Filip Muhammad, Azam Sheikh Chen, Rhongzen Siewers, Verena Verendel, Vilhelm Nielsen, Jens Töpel, Mats Zelezniak, Aleksej Nat Commun Article Understanding the genetic regulatory code governing gene expression is an important challenge in molecular biology. However, how individual coding and non-coding regions of the gene regulatory structure interact and contribute to mRNA expression levels remains unclear. Here we apply deep learning on over 20,000 mRNA datasets to examine the genetic regulatory code controlling mRNA abundance in 7 model organisms ranging from bacteria to Human. In all organisms, we can predict mRNA abundance directly from DNA sequence, with up to 82% of the variation of transcript levels encoded in the gene regulatory structure. By searching for DNA regulatory motifs across the gene regulatory structure, we discover that motif interactions could explain the whole dynamic range of mRNA levels. Co-evolution across coding and non-coding regions suggests that it is not single motifs or regions, but the entire gene regulatory structure and specific combination of regulatory elements that define gene expression levels. Nature Publishing Group UK 2020-12-01 /pmc/articles/PMC7708451/ /pubmed/33262328 http://dx.doi.org/10.1038/s41467-020-19921-4 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Zrimec, Jan
Börlin, Christoph S.
Buric, Filip
Muhammad, Azam Sheikh
Chen, Rhongzen
Siewers, Verena
Verendel, Vilhelm
Nielsen, Jens
Töpel, Mats
Zelezniak, Aleksej
Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure
title Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure
title_full Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure
title_fullStr Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure
title_full_unstemmed Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure
title_short Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure
title_sort deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7708451/
https://www.ncbi.nlm.nih.gov/pubmed/33262328
http://dx.doi.org/10.1038/s41467-020-19921-4
work_keys_str_mv AT zrimecjan deeplearningsuggeststhatgeneexpressionisencodedinallpartsofacoevolvinginteractinggeneregulatorystructure
AT borlinchristophs deeplearningsuggeststhatgeneexpressionisencodedinallpartsofacoevolvinginteractinggeneregulatorystructure
AT buricfilip deeplearningsuggeststhatgeneexpressionisencodedinallpartsofacoevolvinginteractinggeneregulatorystructure
AT muhammadazamsheikh deeplearningsuggeststhatgeneexpressionisencodedinallpartsofacoevolvinginteractinggeneregulatorystructure
AT chenrhongzen deeplearningsuggeststhatgeneexpressionisencodedinallpartsofacoevolvinginteractinggeneregulatorystructure
AT siewersverena deeplearningsuggeststhatgeneexpressionisencodedinallpartsofacoevolvinginteractinggeneregulatorystructure
AT verendelvilhelm deeplearningsuggeststhatgeneexpressionisencodedinallpartsofacoevolvinginteractinggeneregulatorystructure
AT nielsenjens deeplearningsuggeststhatgeneexpressionisencodedinallpartsofacoevolvinginteractinggeneregulatorystructure
AT topelmats deeplearningsuggeststhatgeneexpressionisencodedinallpartsofacoevolvinginteractinggeneregulatorystructure
AT zelezniakaleksej deeplearningsuggeststhatgeneexpressionisencodedinallpartsofacoevolvinginteractinggeneregulatorystructure