Cargando…

Predicting 3D chromatin interactions from DNA sequence using Deep Learning

Gene regulation in eukaryotes is profoundly shaped by the 3D organization of chromatin within the cell nucleus. Distal regulatory interactions between enhancers and their target genes are widespread and many causal loci underlying heritable agricultural or clinical traits have been mapped to distal...

Descripción completa

Detalles Bibliográficos
Autores principales: Piecyk, Robert S., Schlegel, Luca, Johannes, Frank
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9271978/
https://www.ncbi.nlm.nih.gov/pubmed/35832620
http://dx.doi.org/10.1016/j.csbj.2022.06.047
_version_ 1784744794309787648
author Piecyk, Robert S.
Schlegel, Luca
Johannes, Frank
author_facet Piecyk, Robert S.
Schlegel, Luca
Johannes, Frank
author_sort Piecyk, Robert S.
collection PubMed
description Gene regulation in eukaryotes is profoundly shaped by the 3D organization of chromatin within the cell nucleus. Distal regulatory interactions between enhancers and their target genes are widespread and many causal loci underlying heritable agricultural or clinical traits have been mapped to distal cis-regulatory elements. Dissecting the sequence features that mediate such distal interactions is key to understanding their underlying biology. Deep Learning (DL) models coupled with genome-wide 3C-based sequencing data have emerged as powerful tools to infer the DNA sequence grammar underlying such distal interactions. In this review we show that most DL models have remarkably high prediction accuracy, which indicates that DNA sequence features are important determinants of chromatin looping. However, DL model training has so far been limited to a small set of human cell lines, raising questions about the generalization of these predictions to other tissue-types and species. Furthermore, we find that the model architecture seems less relevant for model performance than the training strategy and the data preparation step. Transfer learning, coupled with functionally curated interactions, appear to be the most promising approach to learn cell-type specific and possibly species- specific sequence features in future applications.
format Online
Article
Text
id pubmed-9271978
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-92719782022-07-12 Predicting 3D chromatin interactions from DNA sequence using Deep Learning Piecyk, Robert S. Schlegel, Luca Johannes, Frank Comput Struct Biotechnol J Mini Review Gene regulation in eukaryotes is profoundly shaped by the 3D organization of chromatin within the cell nucleus. Distal regulatory interactions between enhancers and their target genes are widespread and many causal loci underlying heritable agricultural or clinical traits have been mapped to distal cis-regulatory elements. Dissecting the sequence features that mediate such distal interactions is key to understanding their underlying biology. Deep Learning (DL) models coupled with genome-wide 3C-based sequencing data have emerged as powerful tools to infer the DNA sequence grammar underlying such distal interactions. In this review we show that most DL models have remarkably high prediction accuracy, which indicates that DNA sequence features are important determinants of chromatin looping. However, DL model training has so far been limited to a small set of human cell lines, raising questions about the generalization of these predictions to other tissue-types and species. Furthermore, we find that the model architecture seems less relevant for model performance than the training strategy and the data preparation step. Transfer learning, coupled with functionally curated interactions, appear to be the most promising approach to learn cell-type specific and possibly species- specific sequence features in future applications. Research Network of Computational and Structural Biotechnology 2022-06-25 /pmc/articles/PMC9271978/ /pubmed/35832620 http://dx.doi.org/10.1016/j.csbj.2022.06.047 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Mini Review
Piecyk, Robert S.
Schlegel, Luca
Johannes, Frank
Predicting 3D chromatin interactions from DNA sequence using Deep Learning
title Predicting 3D chromatin interactions from DNA sequence using Deep Learning
title_full Predicting 3D chromatin interactions from DNA sequence using Deep Learning
title_fullStr Predicting 3D chromatin interactions from DNA sequence using Deep Learning
title_full_unstemmed Predicting 3D chromatin interactions from DNA sequence using Deep Learning
title_short Predicting 3D chromatin interactions from DNA sequence using Deep Learning
title_sort predicting 3d chromatin interactions from dna sequence using deep learning
topic Mini Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9271978/
https://www.ncbi.nlm.nih.gov/pubmed/35832620
http://dx.doi.org/10.1016/j.csbj.2022.06.047
work_keys_str_mv AT piecykroberts predicting3dchromatininteractionsfromdnasequenceusingdeeplearning
AT schlegelluca predicting3dchromatininteractionsfromdnasequenceusingdeeplearning
AT johannesfrank predicting3dchromatininteractionsfromdnasequenceusingdeeplearning