Cargando…

Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning

Genomic sequence variation within enhancers and promoters can have a significant impact on the cellular state and phenotype. However, sifting through the millions of candidate variants in a personal genome or a cancer genome, to identify those that impact cis-regulatory function, remains a major cha...

Descripción completa

Detalles Bibliográficos
Autores principales: Atak, Zeynep Kalender, Taskiran, Ibrahim Ihsan, Demeulemeester, Jonas, Flerin, Christopher, Mauduit, David, Minnoye, Liesbeth, Hulselmans, Gert, Christiaens, Valerie, Ghanem, Ghanem-Elias, Wouters, Jasper, Aerts, Stein
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8168584/
https://www.ncbi.nlm.nih.gov/pubmed/33832990
http://dx.doi.org/10.1101/gr.260851.120
_version_ 1783701904364142592
author Atak, Zeynep Kalender
Taskiran, Ibrahim Ihsan
Demeulemeester, Jonas
Flerin, Christopher
Mauduit, David
Minnoye, Liesbeth
Hulselmans, Gert
Christiaens, Valerie
Ghanem, Ghanem-Elias
Wouters, Jasper
Aerts, Stein
author_facet Atak, Zeynep Kalender
Taskiran, Ibrahim Ihsan
Demeulemeester, Jonas
Flerin, Christopher
Mauduit, David
Minnoye, Liesbeth
Hulselmans, Gert
Christiaens, Valerie
Ghanem, Ghanem-Elias
Wouters, Jasper
Aerts, Stein
author_sort Atak, Zeynep Kalender
collection PubMed
description Genomic sequence variation within enhancers and promoters can have a significant impact on the cellular state and phenotype. However, sifting through the millions of candidate variants in a personal genome or a cancer genome, to identify those that impact cis-regulatory function, remains a major challenge. Interpretation of noncoding genome variation benefits from explainable artificial intelligence to predict and interpret the impact of a mutation on gene regulation. Here we generate phased whole genomes with matched chromatin accessibility, histone modifications, and gene expression for 10 melanoma cell lines. We find that training a specialized deep learning model, called DeepMEL2, on melanoma chromatin accessibility data can capture the various regulatory programs of the melanocytic and mesenchymal-like melanoma cell states. This model outperforms motif-based variant scoring, as well as more generic deep learning models. We detect hundreds to thousands of allele-specific chromatin accessibility variants (ASCAVs) in each melanoma genome, of which 15%–20% can be explained by gains or losses of transcription factor binding sites. A considerable fraction of ASCAVs are caused by changes in AP-1 binding, as confirmed by matched ChIP-seq data to identify allele-specific binding of JUN and FOSL1. Finally, by augmenting the DeepMEL2 model with ChIP-seq data for GABPA, the TERT promoter mutation, as well as additional ETS motif gains, can be identified with high confidence. In conclusion, we present a new integrative genomics approach and a deep learning model to identify and interpret functional enhancer mutations with allelic imbalance of chromatin accessibility and gene expression.
format Online
Article
Text
id pubmed-8168584
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-81685842021-06-14 Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning Atak, Zeynep Kalender Taskiran, Ibrahim Ihsan Demeulemeester, Jonas Flerin, Christopher Mauduit, David Minnoye, Liesbeth Hulselmans, Gert Christiaens, Valerie Ghanem, Ghanem-Elias Wouters, Jasper Aerts, Stein Genome Res Method Genomic sequence variation within enhancers and promoters can have a significant impact on the cellular state and phenotype. However, sifting through the millions of candidate variants in a personal genome or a cancer genome, to identify those that impact cis-regulatory function, remains a major challenge. Interpretation of noncoding genome variation benefits from explainable artificial intelligence to predict and interpret the impact of a mutation on gene regulation. Here we generate phased whole genomes with matched chromatin accessibility, histone modifications, and gene expression for 10 melanoma cell lines. We find that training a specialized deep learning model, called DeepMEL2, on melanoma chromatin accessibility data can capture the various regulatory programs of the melanocytic and mesenchymal-like melanoma cell states. This model outperforms motif-based variant scoring, as well as more generic deep learning models. We detect hundreds to thousands of allele-specific chromatin accessibility variants (ASCAVs) in each melanoma genome, of which 15%–20% can be explained by gains or losses of transcription factor binding sites. A considerable fraction of ASCAVs are caused by changes in AP-1 binding, as confirmed by matched ChIP-seq data to identify allele-specific binding of JUN and FOSL1. Finally, by augmenting the DeepMEL2 model with ChIP-seq data for GABPA, the TERT promoter mutation, as well as additional ETS motif gains, can be identified with high confidence. In conclusion, we present a new integrative genomics approach and a deep learning model to identify and interpret functional enhancer mutations with allelic imbalance of chromatin accessibility and gene expression. Cold Spring Harbor Laboratory Press 2021-06 /pmc/articles/PMC8168584/ /pubmed/33832990 http://dx.doi.org/10.1101/gr.260851.120 Text en © 2021 Atak et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by/4.0/This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Method
Atak, Zeynep Kalender
Taskiran, Ibrahim Ihsan
Demeulemeester, Jonas
Flerin, Christopher
Mauduit, David
Minnoye, Liesbeth
Hulselmans, Gert
Christiaens, Valerie
Ghanem, Ghanem-Elias
Wouters, Jasper
Aerts, Stein
Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning
title Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning
title_full Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning
title_fullStr Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning
title_full_unstemmed Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning
title_short Interpretation of allele-specific chromatin accessibility using cell state–aware deep learning
title_sort interpretation of allele-specific chromatin accessibility using cell state–aware deep learning
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8168584/
https://www.ncbi.nlm.nih.gov/pubmed/33832990
http://dx.doi.org/10.1101/gr.260851.120
work_keys_str_mv AT atakzeynepkalender interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning
AT taskiranibrahimihsan interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning
AT demeulemeesterjonas interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning
AT flerinchristopher interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning
AT mauduitdavid interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning
AT minnoyeliesbeth interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning
AT hulselmansgert interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning
AT christiaensvalerie interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning
AT ghanemghanemelias interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning
AT woutersjasper interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning
AT aertsstein interpretationofallelespecificchromatinaccessibilityusingcellstateawaredeeplearning