Cargando…

Integrative prediction of gene expression with chromatin accessibility and conformation data

BACKGROUND: Enhancers play a fundamental role in orchestrating cell state and development. Although several methods have been developed to identify enhancers, linking them to their target genes is still an open problem. Several theories have been proposed on the functional mechanisms of enhancers, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Schmidt, Florian, Kern, Fabian, Schulz, Marcel H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7003490/
https://www.ncbi.nlm.nih.gov/pubmed/32029002
http://dx.doi.org/10.1186/s13072-020-0327-0
_version_ 1783494544508059648
author Schmidt, Florian
Kern, Fabian
Schulz, Marcel H.
author_facet Schmidt, Florian
Kern, Fabian
Schulz, Marcel H.
author_sort Schmidt, Florian
collection PubMed
description BACKGROUND: Enhancers play a fundamental role in orchestrating cell state and development. Although several methods have been developed to identify enhancers, linking them to their target genes is still an open problem. Several theories have been proposed on the functional mechanisms of enhancers, which triggered the development of various methods to infer promoter–enhancer interactions (PEIs). The advancement of high-throughput techniques describing the three-dimensional organization of the chromatin, paved the way to pinpoint long-range PEIs. Here we investigated whether including PEIs in computational models for the prediction of gene expression improves performance and interpretability. RESULTS: We have extended our [Formula: see text] framework to include DNA contacts deduced from chromatin conformation capture experiments and compared various methods to determine PEIs using predictive modelling of gene expression from chromatin accessibility data and predicted transcription factor (TF) motif data. We designed a novel machine learning approach that allows the prioritization of TFs binding to distal loop and promoter regions with respect to their importance for gene expression regulation. Our analysis revealed a set of core TFs that are part of enhancer–promoter loops involving YY1 in different cell lines. CONCLUSION: We present a novel approach that can be used to prioritize TFs involved in distal and promoter-proximal regulatory events by integrating chromatin accessibility, conformation, and gene expression data. We show that the integration of chromatin conformation data can improve gene expression prediction and aids model interpretability.
format Online
Article
Text
id pubmed-7003490
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70034902020-02-10 Integrative prediction of gene expression with chromatin accessibility and conformation data Schmidt, Florian Kern, Fabian Schulz, Marcel H. Epigenetics Chromatin Research BACKGROUND: Enhancers play a fundamental role in orchestrating cell state and development. Although several methods have been developed to identify enhancers, linking them to their target genes is still an open problem. Several theories have been proposed on the functional mechanisms of enhancers, which triggered the development of various methods to infer promoter–enhancer interactions (PEIs). The advancement of high-throughput techniques describing the three-dimensional organization of the chromatin, paved the way to pinpoint long-range PEIs. Here we investigated whether including PEIs in computational models for the prediction of gene expression improves performance and interpretability. RESULTS: We have extended our [Formula: see text] framework to include DNA contacts deduced from chromatin conformation capture experiments and compared various methods to determine PEIs using predictive modelling of gene expression from chromatin accessibility data and predicted transcription factor (TF) motif data. We designed a novel machine learning approach that allows the prioritization of TFs binding to distal loop and promoter regions with respect to their importance for gene expression regulation. Our analysis revealed a set of core TFs that are part of enhancer–promoter loops involving YY1 in different cell lines. CONCLUSION: We present a novel approach that can be used to prioritize TFs involved in distal and promoter-proximal regulatory events by integrating chromatin accessibility, conformation, and gene expression data. We show that the integration of chromatin conformation data can improve gene expression prediction and aids model interpretability. BioMed Central 2020-02-06 /pmc/articles/PMC7003490/ /pubmed/32029002 http://dx.doi.org/10.1186/s13072-020-0327-0 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Schmidt, Florian
Kern, Fabian
Schulz, Marcel H.
Integrative prediction of gene expression with chromatin accessibility and conformation data
title Integrative prediction of gene expression with chromatin accessibility and conformation data
title_full Integrative prediction of gene expression with chromatin accessibility and conformation data
title_fullStr Integrative prediction of gene expression with chromatin accessibility and conformation data
title_full_unstemmed Integrative prediction of gene expression with chromatin accessibility and conformation data
title_short Integrative prediction of gene expression with chromatin accessibility and conformation data
title_sort integrative prediction of gene expression with chromatin accessibility and conformation data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7003490/
https://www.ncbi.nlm.nih.gov/pubmed/32029002
http://dx.doi.org/10.1186/s13072-020-0327-0
work_keys_str_mv AT schmidtflorian integrativepredictionofgeneexpressionwithchromatinaccessibilityandconformationdata
AT kernfabian integrativepredictionofgeneexpressionwithchromatinaccessibilityandconformationdata
AT schulzmarcelh integrativepredictionofgeneexpressionwithchromatinaccessibilityandconformationdata