Cargando…

An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila

BACKGROUND: Enhancers are non-coding regions of the genome that control the activity of target genes. Recent efforts to identify active enhancers experimentally and in silico have proven effective. While these tools can predict the locations of enhancers with a high degree of accuracy, the mechanism...

Descripción completa

Detalles Bibliográficos
Autores principales: Wolfe, Jareth C., Mikheeva, Liudmila A., Hagras, Hani, Zabet, Nicolae Radu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574042/
https://www.ncbi.nlm.nih.gov/pubmed/34749786
http://dx.doi.org/10.1186/s13059-021-02532-7
_version_ 1784595540473806848
author Wolfe, Jareth C.
Mikheeva, Liudmila A.
Hagras, Hani
Zabet, Nicolae Radu
author_facet Wolfe, Jareth C.
Mikheeva, Liudmila A.
Hagras, Hani
Zabet, Nicolae Radu
author_sort Wolfe, Jareth C.
collection PubMed
description BACKGROUND: Enhancers are non-coding regions of the genome that control the activity of target genes. Recent efforts to identify active enhancers experimentally and in silico have proven effective. While these tools can predict the locations of enhancers with a high degree of accuracy, the mechanisms underpinning the activity of enhancers are often unclear. RESULTS: Using machine learning (ML) and a rule-based explainable artificial intelligence (XAI) model, we demonstrate that we can predict the location of known enhancers in Drosophila with a high degree of accuracy. Most importantly, we use the rules of the XAI model to provide insight into the underlying combinatorial histone modifications code of enhancers. In addition, we identified a large set of putative enhancers that display the same epigenetic signature as enhancers identified experimentally. These putative enhancers are enriched in nascent transcription, divergent transcription and have 3D contacts with promoters of transcribed genes. However, they display only intermediary enrichment of mediator and cohesin complexes compared to previously characterised active enhancers. We also found that 10–15% of the predicted enhancers display similar characteristics to super enhancers observed in other species. CONCLUSIONS: Here, we applied an explainable AI model to predict enhancers with high accuracy. Most importantly, we identified that different combinations of epigenetic marks characterise different groups of enhancers. Finally, we discovered a large set of putative enhancers which display similar characteristics with previously characterised active enhancers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02532-7.
format Online
Article
Text
id pubmed-8574042
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-85740422021-11-08 An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila Wolfe, Jareth C. Mikheeva, Liudmila A. Hagras, Hani Zabet, Nicolae Radu Genome Biol Research BACKGROUND: Enhancers are non-coding regions of the genome that control the activity of target genes. Recent efforts to identify active enhancers experimentally and in silico have proven effective. While these tools can predict the locations of enhancers with a high degree of accuracy, the mechanisms underpinning the activity of enhancers are often unclear. RESULTS: Using machine learning (ML) and a rule-based explainable artificial intelligence (XAI) model, we demonstrate that we can predict the location of known enhancers in Drosophila with a high degree of accuracy. Most importantly, we use the rules of the XAI model to provide insight into the underlying combinatorial histone modifications code of enhancers. In addition, we identified a large set of putative enhancers that display the same epigenetic signature as enhancers identified experimentally. These putative enhancers are enriched in nascent transcription, divergent transcription and have 3D contacts with promoters of transcribed genes. However, they display only intermediary enrichment of mediator and cohesin complexes compared to previously characterised active enhancers. We also found that 10–15% of the predicted enhancers display similar characteristics to super enhancers observed in other species. CONCLUSIONS: Here, we applied an explainable AI model to predict enhancers with high accuracy. Most importantly, we identified that different combinations of epigenetic marks characterise different groups of enhancers. Finally, we discovered a large set of putative enhancers which display similar characteristics with previously characterised active enhancers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02532-7. BioMed Central 2021-11-08 /pmc/articles/PMC8574042/ /pubmed/34749786 http://dx.doi.org/10.1186/s13059-021-02532-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Wolfe, Jareth C.
Mikheeva, Liudmila A.
Hagras, Hani
Zabet, Nicolae Radu
An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila
title An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila
title_full An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila
title_fullStr An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila
title_full_unstemmed An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila
title_short An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila
title_sort explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in drosophila
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574042/
https://www.ncbi.nlm.nih.gov/pubmed/34749786
http://dx.doi.org/10.1186/s13059-021-02532-7
work_keys_str_mv AT wolfejarethc anexplainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila
AT mikheevaliudmilaa anexplainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila
AT hagrashani anexplainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila
AT zabetnicolaeradu anexplainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila
AT wolfejarethc explainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila
AT mikheevaliudmilaa explainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila
AT hagrashani explainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila
AT zabetnicolaeradu explainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila