Cargando…

Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants

The aryl hydrocarbon receptor (AhR) is an inducible transcription factor whose ligands include the potent environmental contaminant 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Ligand-activated AhR binds to DNA at dioxin response elements (DREs) containing the core motif 5′-GCGTG-3′. However, AhR bin...

Descripción completa

Detalles Bibliográficos
Autores principales: Filipovic, David, Qi, Wenjie, Kana, Omar, Marri, Daniel, LeCluyse, Edward L, Andersen, Melvin E, Cuddapah, Suresh, Bhattacharya, Sudin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10682972/
https://www.ncbi.nlm.nih.gov/pubmed/37707797
http://dx.doi.org/10.1093/toxsci/kfad094
_version_ 1785151090132516864
author Filipovic, David
Qi, Wenjie
Kana, Omar
Marri, Daniel
LeCluyse, Edward L
Andersen, Melvin E
Cuddapah, Suresh
Bhattacharya, Sudin
author_facet Filipovic, David
Qi, Wenjie
Kana, Omar
Marri, Daniel
LeCluyse, Edward L
Andersen, Melvin E
Cuddapah, Suresh
Bhattacharya, Sudin
author_sort Filipovic, David
collection PubMed
description The aryl hydrocarbon receptor (AhR) is an inducible transcription factor whose ligands include the potent environmental contaminant 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Ligand-activated AhR binds to DNA at dioxin response elements (DREs) containing the core motif 5′-GCGTG-3′. However, AhR binding is highly tissue specific. Most DREs in accessible chromatin are not bound by TCDD-activated AhR, and DREs accessible in multiple tissues can be bound in some and unbound in others. As such, AhR functions similarly to many nuclear receptors. Given that AhR possesses a strong core motif, it is suited for a motif-centered analysis of its binding. We developed interpretable machine learning models predicting the AhR binding status of DREs in MCF-7, GM17212, and HepG2 cells, as well as primary human hepatocytes. Cross-tissue models predicting transcription factor (TF)-DNA binding generally perform poorly. However, reasons for the low performance remain unexplored. By interpreting the results of individual within-tissue models and by examining the features leading to low cross-tissue performance, we identified sequence and chromatin context patterns correlated with AhR binding. We conclude that AhR binding is driven by a complex interplay of tissue-agnostic DRE flanking DNA sequence and tissue-specific local chromatin context. Additionally, we demonstrate that interpretable machine learning models can provide novel and experimentally testable mechanistic insights into DNA binding by inducible TFs.
format Online
Article
Text
id pubmed-10682972
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106829722023-11-30 Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants Filipovic, David Qi, Wenjie Kana, Omar Marri, Daniel LeCluyse, Edward L Andersen, Melvin E Cuddapah, Suresh Bhattacharya, Sudin Toxicol Sci Computational Toxicology and Databases The aryl hydrocarbon receptor (AhR) is an inducible transcription factor whose ligands include the potent environmental contaminant 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Ligand-activated AhR binds to DNA at dioxin response elements (DREs) containing the core motif 5′-GCGTG-3′. However, AhR binding is highly tissue specific. Most DREs in accessible chromatin are not bound by TCDD-activated AhR, and DREs accessible in multiple tissues can be bound in some and unbound in others. As such, AhR functions similarly to many nuclear receptors. Given that AhR possesses a strong core motif, it is suited for a motif-centered analysis of its binding. We developed interpretable machine learning models predicting the AhR binding status of DREs in MCF-7, GM17212, and HepG2 cells, as well as primary human hepatocytes. Cross-tissue models predicting transcription factor (TF)-DNA binding generally perform poorly. However, reasons for the low performance remain unexplored. By interpreting the results of individual within-tissue models and by examining the features leading to low cross-tissue performance, we identified sequence and chromatin context patterns correlated with AhR binding. We conclude that AhR binding is driven by a complex interplay of tissue-agnostic DRE flanking DNA sequence and tissue-specific local chromatin context. Additionally, we demonstrate that interpretable machine learning models can provide novel and experimentally testable mechanistic insights into DNA binding by inducible TFs. Oxford University Press 2023-09-14 /pmc/articles/PMC10682972/ /pubmed/37707797 http://dx.doi.org/10.1093/toxsci/kfad094 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the Society of Toxicology. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Toxicology and Databases
Filipovic, David
Qi, Wenjie
Kana, Omar
Marri, Daniel
LeCluyse, Edward L
Andersen, Melvin E
Cuddapah, Suresh
Bhattacharya, Sudin
Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants
title Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants
title_full Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants
title_fullStr Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants
title_full_unstemmed Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants
title_short Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants
title_sort interpretable predictive models of genome-wide aryl hydrocarbon receptor-dna binding reveal tissue-specific binding determinants
topic Computational Toxicology and Databases
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10682972/
https://www.ncbi.nlm.nih.gov/pubmed/37707797
http://dx.doi.org/10.1093/toxsci/kfad094
work_keys_str_mv AT filipovicdavid interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants
AT qiwenjie interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants
AT kanaomar interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants
AT marridaniel interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants
AT lecluyseedwardl interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants
AT andersenmelvine interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants
AT cuddapahsuresh interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants
AT bhattacharyasudin interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants