Cargando…
An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila
BACKGROUND: Enhancers are non-coding regions of the genome that control the activity of target genes. Recent efforts to identify active enhancers experimentally and in silico have proven effective. While these tools can predict the locations of enhancers with a high degree of accuracy, the mechanism...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574042/ https://www.ncbi.nlm.nih.gov/pubmed/34749786 http://dx.doi.org/10.1186/s13059-021-02532-7 |
_version_ | 1784595540473806848 |
---|---|
author | Wolfe, Jareth C. Mikheeva, Liudmila A. Hagras, Hani Zabet, Nicolae Radu |
author_facet | Wolfe, Jareth C. Mikheeva, Liudmila A. Hagras, Hani Zabet, Nicolae Radu |
author_sort | Wolfe, Jareth C. |
collection | PubMed |
description | BACKGROUND: Enhancers are non-coding regions of the genome that control the activity of target genes. Recent efforts to identify active enhancers experimentally and in silico have proven effective. While these tools can predict the locations of enhancers with a high degree of accuracy, the mechanisms underpinning the activity of enhancers are often unclear. RESULTS: Using machine learning (ML) and a rule-based explainable artificial intelligence (XAI) model, we demonstrate that we can predict the location of known enhancers in Drosophila with a high degree of accuracy. Most importantly, we use the rules of the XAI model to provide insight into the underlying combinatorial histone modifications code of enhancers. In addition, we identified a large set of putative enhancers that display the same epigenetic signature as enhancers identified experimentally. These putative enhancers are enriched in nascent transcription, divergent transcription and have 3D contacts with promoters of transcribed genes. However, they display only intermediary enrichment of mediator and cohesin complexes compared to previously characterised active enhancers. We also found that 10–15% of the predicted enhancers display similar characteristics to super enhancers observed in other species. CONCLUSIONS: Here, we applied an explainable AI model to predict enhancers with high accuracy. Most importantly, we identified that different combinations of epigenetic marks characterise different groups of enhancers. Finally, we discovered a large set of putative enhancers which display similar characteristics with previously characterised active enhancers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02532-7. |
format | Online Article Text |
id | pubmed-8574042 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-85740422021-11-08 An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila Wolfe, Jareth C. Mikheeva, Liudmila A. Hagras, Hani Zabet, Nicolae Radu Genome Biol Research BACKGROUND: Enhancers are non-coding regions of the genome that control the activity of target genes. Recent efforts to identify active enhancers experimentally and in silico have proven effective. While these tools can predict the locations of enhancers with a high degree of accuracy, the mechanisms underpinning the activity of enhancers are often unclear. RESULTS: Using machine learning (ML) and a rule-based explainable artificial intelligence (XAI) model, we demonstrate that we can predict the location of known enhancers in Drosophila with a high degree of accuracy. Most importantly, we use the rules of the XAI model to provide insight into the underlying combinatorial histone modifications code of enhancers. In addition, we identified a large set of putative enhancers that display the same epigenetic signature as enhancers identified experimentally. These putative enhancers are enriched in nascent transcription, divergent transcription and have 3D contacts with promoters of transcribed genes. However, they display only intermediary enrichment of mediator and cohesin complexes compared to previously characterised active enhancers. We also found that 10–15% of the predicted enhancers display similar characteristics to super enhancers observed in other species. CONCLUSIONS: Here, we applied an explainable AI model to predict enhancers with high accuracy. Most importantly, we identified that different combinations of epigenetic marks characterise different groups of enhancers. Finally, we discovered a large set of putative enhancers which display similar characteristics with previously characterised active enhancers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02532-7. BioMed Central 2021-11-08 /pmc/articles/PMC8574042/ /pubmed/34749786 http://dx.doi.org/10.1186/s13059-021-02532-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Wolfe, Jareth C. Mikheeva, Liudmila A. Hagras, Hani Zabet, Nicolae Radu An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila |
title | An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila |
title_full | An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila |
title_fullStr | An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila |
title_full_unstemmed | An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila |
title_short | An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila |
title_sort | explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in drosophila |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574042/ https://www.ncbi.nlm.nih.gov/pubmed/34749786 http://dx.doi.org/10.1186/s13059-021-02532-7 |
work_keys_str_mv | AT wolfejarethc anexplainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila AT mikheevaliudmilaa anexplainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila AT hagrashani anexplainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila AT zabetnicolaeradu anexplainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila AT wolfejarethc explainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila AT mikheevaliudmilaa explainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila AT hagrashani explainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila AT zabetnicolaeradu explainableartificialintelligenceapproachfordecodingtheenhancerhistonemodificationscodeandidentificationofnovelenhancersindrosophila |