Cargando…
Predicting 3D genome folding from DNA sequence with Akita
In interphase, the human genome sequence folds in three dimensions into a rich variety of locus-specific contact patterns. Cohesin and CTCF are key regulators; perturbing the levels of either greatly disrupts genome-wide folding as assayed by chromosome conformation capture methods. Still, how a giv...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8211359/ https://www.ncbi.nlm.nih.gov/pubmed/33046897 http://dx.doi.org/10.1038/s41592-020-0958-x |
_version_ | 1783709461192376320 |
---|---|
author | Fudenberg, Geoff Kelley, David R. Pollard, Katherine S. |
author_facet | Fudenberg, Geoff Kelley, David R. Pollard, Katherine S. |
author_sort | Fudenberg, Geoff |
collection | PubMed |
description | In interphase, the human genome sequence folds in three dimensions into a rich variety of locus-specific contact patterns. Cohesin and CTCF are key regulators; perturbing the levels of either greatly disrupts genome-wide folding as assayed by chromosome conformation capture methods. Still, how a given DNA sequence encodes a particular locus-specific folding pattern remains unknown. Here we present a convolutional neural network, Akita, that accurately predicts genome folding from DNA sequence alone. Representations learned by Akita underscore the importance of an orientation-specific grammar for CTCF binding sites. Akita learns predictive nucleotide-level features of genome folding, revealing impacts of nucleotides beyond the core CTCF motif. Once trained, Akita enables rapid in silico predictions. Leveraging this, we demonstrate how Akita can be used to perform in silico saturation mutagenesis, interpret eQTLs, make predictions for structural variants, and probe species-specific genome folding. Collectively, these results enable decoding genome function from sequence through structure. |
format | Online Article Text |
id | pubmed-8211359 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-82113592021-06-17 Predicting 3D genome folding from DNA sequence with Akita Fudenberg, Geoff Kelley, David R. Pollard, Katherine S. Nat Methods Article In interphase, the human genome sequence folds in three dimensions into a rich variety of locus-specific contact patterns. Cohesin and CTCF are key regulators; perturbing the levels of either greatly disrupts genome-wide folding as assayed by chromosome conformation capture methods. Still, how a given DNA sequence encodes a particular locus-specific folding pattern remains unknown. Here we present a convolutional neural network, Akita, that accurately predicts genome folding from DNA sequence alone. Representations learned by Akita underscore the importance of an orientation-specific grammar for CTCF binding sites. Akita learns predictive nucleotide-level features of genome folding, revealing impacts of nucleotides beyond the core CTCF motif. Once trained, Akita enables rapid in silico predictions. Leveraging this, we demonstrate how Akita can be used to perform in silico saturation mutagenesis, interpret eQTLs, make predictions for structural variants, and probe species-specific genome folding. Collectively, these results enable decoding genome function from sequence through structure. 2020-10-12 2020-11 /pmc/articles/PMC8211359/ /pubmed/33046897 http://dx.doi.org/10.1038/s41592-020-0958-x Text en http://www.nature.com/authors/editorial_policies/license.html#termsUsers may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms |
spellingShingle | Article Fudenberg, Geoff Kelley, David R. Pollard, Katherine S. Predicting 3D genome folding from DNA sequence with Akita |
title | Predicting 3D genome folding from DNA sequence with Akita |
title_full | Predicting 3D genome folding from DNA sequence with Akita |
title_fullStr | Predicting 3D genome folding from DNA sequence with Akita |
title_full_unstemmed | Predicting 3D genome folding from DNA sequence with Akita |
title_short | Predicting 3D genome folding from DNA sequence with Akita |
title_sort | predicting 3d genome folding from dna sequence with akita |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8211359/ https://www.ncbi.nlm.nih.gov/pubmed/33046897 http://dx.doi.org/10.1038/s41592-020-0958-x |
work_keys_str_mv | AT fudenberggeoff predicting3dgenomefoldingfromdnasequencewithakita AT kelleydavidr predicting3dgenomefoldingfromdnasequencewithakita AT pollardkatherines predicting3dgenomefoldingfromdnasequencewithakita |