Cargando…
Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning
Genomic sequence variation within enhancers and promoters can have a significant impact on the cellular state and phenotype. However, sifting through the millions of candidate variants in a personal genome or a cancer genome, to identify those that impact cis-regulatory function, remains a major cha...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8168584/ https://www.ncbi.nlm.nih.gov/pubmed/33832990 http://dx.doi.org/10.1101/gr.260851.120 |
_version_ | 1783701904364142592 |
---|---|
author | Atak, Zeynep Kalender Taskiran, Ibrahim Ihsan Demeulemeester, Jonas Flerin, Christopher Mauduit, David Minnoye, Liesbeth Hulselmans, Gert Christiaens, Valerie Ghanem, Ghanem-Elias Wouters, Jasper Aerts, Stein |
author_facet | Atak, Zeynep Kalender Taskiran, Ibrahim Ihsan Demeulemeester, Jonas Flerin, Christopher Mauduit, David Minnoye, Liesbeth Hulselmans, Gert Christiaens, Valerie Ghanem, Ghanem-Elias Wouters, Jasper Aerts, Stein |
author_sort | Atak, Zeynep Kalender |
collection | PubMed |
description | Genomic sequence variation within enhancers and promoters can have a significant impact on the cellular state and phenotype. However, sifting through the millions of candidate variants in a personal genome or a cancer genome, to identify those that impact cis-regulatory function, remains a major challenge. Interpretation of noncoding genome variation benefits from explainable artificial intelligence to predict and interpret the impact of a mutation on gene regulation. Here we generate phased whole genomes with matched chromatin accessibility, histone modifications, and gene expression for 10 melanoma cell lines. We find that training a specialized deep learning model, called DeepMEL2, on melanoma chromatin accessibility data can capture the various regulatory programs of the melanocytic and mesenchymal-like melanoma cell states. This model outperforms motif-based variant scoring, as well as more generic deep learning models. We detect hundreds to thousands of allele-specific chromatin accessibility variants (ASCAVs) in each melanoma genome, of which 15%–20% can be explained by gains or losses of transcription factor binding sites. A considerable fraction of ASCAVs are caused by changes in AP-1 binding, as confirmed by matched ChIP-seq data to identify allele-specific binding of JUN and FOSL1. Finally, by augmenting the DeepMEL2 model with ChIP-seq data for GABPA, the TERT promoter mutation, as well as additional ETS motif gains, can be identified with high confidence. In conclusion, we present a new integrative genomics approach and a deep learning model to identify and interpret functional enhancer mutations with allelic imbalance of chromatin accessibility and gene expression. |
format | Online Article Text |
id | pubmed-8168584 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-81685842021-06-14 Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning Atak, Zeynep Kalender Taskiran, Ibrahim Ihsan Demeulemeester, Jonas Flerin, Christopher Mauduit, David Minnoye, Liesbeth Hulselmans, Gert Christiaens, Valerie Ghanem, Ghanem-Elias Wouters, Jasper Aerts, Stein Genome Res Method Genomic sequence variation within enhancers and promoters can have a significant impact on the cellular state and phenotype. However, sifting through the millions of candidate variants in a personal genome or a cancer genome, to identify those that impact cis-regulatory function, remains a major challenge. Interpretation of noncoding genome variation benefits from explainable artificial intelligence to predict and interpret the impact of a mutation on gene regulation. Here we generate phased whole genomes with matched chromatin accessibility, histone modifications, and gene expression for 10 melanoma cell lines. We find that training a specialized deep learning model, called DeepMEL2, on melanoma chromatin accessibility data can capture the various regulatory programs of the melanocytic and mesenchymal-like melanoma cell states. This model outperforms motif-based variant scoring, as well as more generic deep learning models. We detect hundreds to thousands of allele-specific chromatin accessibility variants (ASCAVs) in each melanoma genome, of which 15%–20% can be explained by gains or losses of transcription factor binding sites. A considerable fraction of ASCAVs are caused by changes in AP-1 binding, as confirmed by matched ChIP-seq data to identify allele-specific binding of JUN and FOSL1. Finally, by augmenting the DeepMEL2 model with ChIP-seq data for GABPA, the TERT promoter mutation, as well as additional ETS motif gains, can be identified with high confidence. In conclusion, we present a new integrative genomics approach and a deep learning model to identify and interpret functional enhancer mutations with allelic imbalance of chromatin accessibility and gene expression. Cold Spring Harbor Laboratory Press 2021-06 /pmc/articles/PMC8168584/ /pubmed/33832990 http://dx.doi.org/10.1101/gr.260851.120 Text en © 2021 Atak et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by/4.0/This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Method Atak, Zeynep Kalender Taskiran, Ibrahim Ihsan Demeulemeester, Jonas Flerin, Christopher Mauduit, David Minnoye, Liesbeth Hulselmans, Gert Christiaens, Valerie Ghanem, Ghanem-Elias Wouters, Jasper Aerts, Stein Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning |
title | Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning |
title_full | Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning |
title_fullStr | Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning |
title_full_unstemmed | Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning |
title_short | Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning |
title_sort | interpretation of allele-specific chromatin accessibility using cell state–aware deep learning |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8168584/ https://www.ncbi.nlm.nih.gov/pubmed/33832990 http://dx.doi.org/10.1101/gr.260851.120 |
work_keys_str_mv | AT atakzeynepkalender interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning AT taskiranibrahimihsan interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning AT demeulemeesterjonas interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning AT flerinchristopher interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning AT mauduitdavid interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning AT minnoyeliesbeth interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning AT hulselmansgert interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning AT christiaensvalerie interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning AT ghanemghanemelias interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning AT woutersjasper interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning AT aertsstein interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning |