Cargando…

Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks

Deciphering the sequence-function relationship encoded in enhancers holds the key to interpreting non-coding variants and understanding mechanisms of transcriptomic variation. Several quantitative models exist for predicting enhancer function and underlying mechanisms; however, there has been no sys...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dibaeinia, Payam, Sinha, Saurabh
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2021
Materias:	Computational Biology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8501998/ https://www.ncbi.nlm.nih.gov/pubmed/34508359 http://dx.doi.org/10.1093/nar/gkab765

_version_	1784580791284531200
author	Dibaeinia, Payam Sinha, Saurabh
author_facet	Dibaeinia, Payam Sinha, Saurabh
author_sort	Dibaeinia, Payam
collection	PubMed
description	Deciphering the sequence-function relationship encoded in enhancers holds the key to interpreting non-coding variants and understanding mechanisms of transcriptomic variation. Several quantitative models exist for predicting enhancer function and underlying mechanisms; however, there has been no systematic comparison of these models characterizing their relative strengths and shortcomings. Here, we interrogated a rich data set of neuroectodermal enhancers in Drosophila, representing cis- and trans- sources of expression variation, with a suite of biophysical and machine learning models. We performed rigorous comparisons of thermodynamics-based models implementing different mechanisms of activation, repression and cooperativity. Moreover, we developed a convolutional neural network (CNN) model, called CoNSEPT, that learns enhancer ‘grammar’ in an unbiased manner. CoNSEPT is the first general-purpose CNN tool for predicting enhancer function in varying conditions, such as different cell types and experimental conditions, and we show that such complex models can suggest interpretable mechanisms. We found model-based evidence for mechanisms previously established for the studied system, including cooperative activation and short-range repression. The data also favored one hypothesized activation mechanism over another and suggested an intriguing role for a direct, distance-independent repression mechanism. Our modeling shows that while fundamentally different models can yield similar fits to data, they vary in their utility for mechanistic inference. CoNSEPT is freely available at: https://github.com/PayamDiba/CoNSEPT.
format	Online Article Text
id	pubmed-8501998
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-85019982021-10-12 Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks Dibaeinia, Payam Sinha, Saurabh Nucleic Acids Res Computational Biology Deciphering the sequence-function relationship encoded in enhancers holds the key to interpreting non-coding variants and understanding mechanisms of transcriptomic variation. Several quantitative models exist for predicting enhancer function and underlying mechanisms; however, there has been no systematic comparison of these models characterizing their relative strengths and shortcomings. Here, we interrogated a rich data set of neuroectodermal enhancers in Drosophila, representing cis- and trans- sources of expression variation, with a suite of biophysical and machine learning models. We performed rigorous comparisons of thermodynamics-based models implementing different mechanisms of activation, repression and cooperativity. Moreover, we developed a convolutional neural network (CNN) model, called CoNSEPT, that learns enhancer ‘grammar’ in an unbiased manner. CoNSEPT is the first general-purpose CNN tool for predicting enhancer function in varying conditions, such as different cell types and experimental conditions, and we show that such complex models can suggest interpretable mechanisms. We found model-based evidence for mechanisms previously established for the studied system, including cooperative activation and short-range repression. The data also favored one hypothesized activation mechanism over another and suggested an intriguing role for a direct, distance-independent repression mechanism. Our modeling shows that while fundamentally different models can yield similar fits to data, they vary in their utility for mechanistic inference. CoNSEPT is freely available at: https://github.com/PayamDiba/CoNSEPT. Oxford University Press 2021-09-11 /pmc/articles/PMC8501998/ /pubmed/34508359 http://dx.doi.org/10.1093/nar/gkab765 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Computational Biology Dibaeinia, Payam Sinha, Saurabh Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks
title	Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks
title_full	Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks
title_fullStr	Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks
title_full_unstemmed	Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks
title_short	Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks
title_sort	deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks
topic	Computational Biology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8501998/ https://www.ncbi.nlm.nih.gov/pubmed/34508359 http://dx.doi.org/10.1093/nar/gkab765
work_keys_str_mv	AT dibaeiniapayam decipheringenhancersequenceusingthermodynamicsbasedmodelsandconvolutionalneuralnetworks AT sinhasaurabh decipheringenhancersequenceusingthermodynamicsbasedmodelsandconvolutionalneuralnetworks

Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks

Ejemplares similares