Cargando…
Predicting 3D chromatin interactions from DNA sequence using Deep Learning
Gene regulation in eukaryotes is profoundly shaped by the 3D organization of chromatin within the cell nucleus. Distal regulatory interactions between enhancers and their target genes are widespread and many causal loci underlying heritable agricultural or clinical traits have been mapped to distal...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9271978/ https://www.ncbi.nlm.nih.gov/pubmed/35832620 http://dx.doi.org/10.1016/j.csbj.2022.06.047 |
_version_ | 1784744794309787648 |
---|---|
author | Piecyk, Robert S. Schlegel, Luca Johannes, Frank |
author_facet | Piecyk, Robert S. Schlegel, Luca Johannes, Frank |
author_sort | Piecyk, Robert S. |
collection | PubMed |
description | Gene regulation in eukaryotes is profoundly shaped by the 3D organization of chromatin within the cell nucleus. Distal regulatory interactions between enhancers and their target genes are widespread and many causal loci underlying heritable agricultural or clinical traits have been mapped to distal cis-regulatory elements. Dissecting the sequence features that mediate such distal interactions is key to understanding their underlying biology. Deep Learning (DL) models coupled with genome-wide 3C-based sequencing data have emerged as powerful tools to infer the DNA sequence grammar underlying such distal interactions. In this review we show that most DL models have remarkably high prediction accuracy, which indicates that DNA sequence features are important determinants of chromatin looping. However, DL model training has so far been limited to a small set of human cell lines, raising questions about the generalization of these predictions to other tissue-types and species. Furthermore, we find that the model architecture seems less relevant for model performance than the training strategy and the data preparation step. Transfer learning, coupled with functionally curated interactions, appear to be the most promising approach to learn cell-type specific and possibly species- specific sequence features in future applications. |
format | Online Article Text |
id | pubmed-9271978 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-92719782022-07-12 Predicting 3D chromatin interactions from DNA sequence using Deep Learning Piecyk, Robert S. Schlegel, Luca Johannes, Frank Comput Struct Biotechnol J Mini Review Gene regulation in eukaryotes is profoundly shaped by the 3D organization of chromatin within the cell nucleus. Distal regulatory interactions between enhancers and their target genes are widespread and many causal loci underlying heritable agricultural or clinical traits have been mapped to distal cis-regulatory elements. Dissecting the sequence features that mediate such distal interactions is key to understanding their underlying biology. Deep Learning (DL) models coupled with genome-wide 3C-based sequencing data have emerged as powerful tools to infer the DNA sequence grammar underlying such distal interactions. In this review we show that most DL models have remarkably high prediction accuracy, which indicates that DNA sequence features are important determinants of chromatin looping. However, DL model training has so far been limited to a small set of human cell lines, raising questions about the generalization of these predictions to other tissue-types and species. Furthermore, we find that the model architecture seems less relevant for model performance than the training strategy and the data preparation step. Transfer learning, coupled with functionally curated interactions, appear to be the most promising approach to learn cell-type specific and possibly species- specific sequence features in future applications. Research Network of Computational and Structural Biotechnology 2022-06-25 /pmc/articles/PMC9271978/ /pubmed/35832620 http://dx.doi.org/10.1016/j.csbj.2022.06.047 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Mini Review Piecyk, Robert S. Schlegel, Luca Johannes, Frank Predicting 3D chromatin interactions from DNA sequence using Deep Learning |
title | Predicting 3D chromatin interactions from DNA sequence using Deep Learning |
title_full | Predicting 3D chromatin interactions from DNA sequence using Deep Learning |
title_fullStr | Predicting 3D chromatin interactions from DNA sequence using Deep Learning |
title_full_unstemmed | Predicting 3D chromatin interactions from DNA sequence using Deep Learning |
title_short | Predicting 3D chromatin interactions from DNA sequence using Deep Learning |
title_sort | predicting 3d chromatin interactions from dna sequence using deep learning |
topic | Mini Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9271978/ https://www.ncbi.nlm.nih.gov/pubmed/35832620 http://dx.doi.org/10.1016/j.csbj.2022.06.047 |
work_keys_str_mv | AT piecykroberts predicting3dchromatininteractionsfromdnasequenceusingdeeplearning AT schlegelluca predicting3dchromatininteractionsfromdnasequenceusingdeeplearning AT johannesfrank predicting3dchromatininteractionsfromdnasequenceusingdeeplearning |