Cargando…
Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants
The aryl hydrocarbon receptor (AhR) is an inducible transcription factor whose ligands include the potent environmental contaminant 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Ligand-activated AhR binds to DNA at dioxin response elements (DREs) containing the core motif 5′-GCGTG-3′. However, AhR bin...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10682972/ https://www.ncbi.nlm.nih.gov/pubmed/37707797 http://dx.doi.org/10.1093/toxsci/kfad094 |
_version_ | 1785151090132516864 |
---|---|
author | Filipovic, David Qi, Wenjie Kana, Omar Marri, Daniel LeCluyse, Edward L Andersen, Melvin E Cuddapah, Suresh Bhattacharya, Sudin |
author_facet | Filipovic, David Qi, Wenjie Kana, Omar Marri, Daniel LeCluyse, Edward L Andersen, Melvin E Cuddapah, Suresh Bhattacharya, Sudin |
author_sort | Filipovic, David |
collection | PubMed |
description | The aryl hydrocarbon receptor (AhR) is an inducible transcription factor whose ligands include the potent environmental contaminant 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Ligand-activated AhR binds to DNA at dioxin response elements (DREs) containing the core motif 5′-GCGTG-3′. However, AhR binding is highly tissue specific. Most DREs in accessible chromatin are not bound by TCDD-activated AhR, and DREs accessible in multiple tissues can be bound in some and unbound in others. As such, AhR functions similarly to many nuclear receptors. Given that AhR possesses a strong core motif, it is suited for a motif-centered analysis of its binding. We developed interpretable machine learning models predicting the AhR binding status of DREs in MCF-7, GM17212, and HepG2 cells, as well as primary human hepatocytes. Cross-tissue models predicting transcription factor (TF)-DNA binding generally perform poorly. However, reasons for the low performance remain unexplored. By interpreting the results of individual within-tissue models and by examining the features leading to low cross-tissue performance, we identified sequence and chromatin context patterns correlated with AhR binding. We conclude that AhR binding is driven by a complex interplay of tissue-agnostic DRE flanking DNA sequence and tissue-specific local chromatin context. Additionally, we demonstrate that interpretable machine learning models can provide novel and experimentally testable mechanistic insights into DNA binding by inducible TFs. |
format | Online Article Text |
id | pubmed-10682972 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106829722023-11-30 Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants Filipovic, David Qi, Wenjie Kana, Omar Marri, Daniel LeCluyse, Edward L Andersen, Melvin E Cuddapah, Suresh Bhattacharya, Sudin Toxicol Sci Computational Toxicology and Databases The aryl hydrocarbon receptor (AhR) is an inducible transcription factor whose ligands include the potent environmental contaminant 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Ligand-activated AhR binds to DNA at dioxin response elements (DREs) containing the core motif 5′-GCGTG-3′. However, AhR binding is highly tissue specific. Most DREs in accessible chromatin are not bound by TCDD-activated AhR, and DREs accessible in multiple tissues can be bound in some and unbound in others. As such, AhR functions similarly to many nuclear receptors. Given that AhR possesses a strong core motif, it is suited for a motif-centered analysis of its binding. We developed interpretable machine learning models predicting the AhR binding status of DREs in MCF-7, GM17212, and HepG2 cells, as well as primary human hepatocytes. Cross-tissue models predicting transcription factor (TF)-DNA binding generally perform poorly. However, reasons for the low performance remain unexplored. By interpreting the results of individual within-tissue models and by examining the features leading to low cross-tissue performance, we identified sequence and chromatin context patterns correlated with AhR binding. We conclude that AhR binding is driven by a complex interplay of tissue-agnostic DRE flanking DNA sequence and tissue-specific local chromatin context. Additionally, we demonstrate that interpretable machine learning models can provide novel and experimentally testable mechanistic insights into DNA binding by inducible TFs. Oxford University Press 2023-09-14 /pmc/articles/PMC10682972/ /pubmed/37707797 http://dx.doi.org/10.1093/toxsci/kfad094 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the Society of Toxicology. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Toxicology and Databases Filipovic, David Qi, Wenjie Kana, Omar Marri, Daniel LeCluyse, Edward L Andersen, Melvin E Cuddapah, Suresh Bhattacharya, Sudin Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants |
title | Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants |
title_full | Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants |
title_fullStr | Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants |
title_full_unstemmed | Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants |
title_short | Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants |
title_sort | interpretable predictive models of genome-wide aryl hydrocarbon receptor-dna binding reveal tissue-specific binding determinants |
topic | Computational Toxicology and Databases |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10682972/ https://www.ncbi.nlm.nih.gov/pubmed/37707797 http://dx.doi.org/10.1093/toxsci/kfad094 |
work_keys_str_mv | AT filipovicdavid interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants AT qiwenjie interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants AT kanaomar interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants AT marridaniel interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants AT lecluyseedwardl interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants AT andersenmelvine interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants AT cuddapahsuresh interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants AT bhattacharyasudin interpretablepredictivemodelsofgenomewidearylhydrocarbonreceptordnabindingrevealtissuespecificbindingdeterminants |