Cargando…

Predicting 3D genome folding from DNA sequence with Akita

In interphase, the human genome sequence folds in three dimensions into a rich variety of locus-specific contact patterns. Cohesin and CTCF are key regulators; perturbing the levels of either greatly disrupts genome-wide folding as assayed by chromosome conformation capture methods. Still, how a giv...

Descripción completa

Detalles Bibliográficos
Autores principales: Fudenberg, Geoff, Kelley, David R., Pollard, Katherine S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8211359/
https://www.ncbi.nlm.nih.gov/pubmed/33046897
http://dx.doi.org/10.1038/s41592-020-0958-x
_version_ 1783709461192376320
author Fudenberg, Geoff
Kelley, David R.
Pollard, Katherine S.
author_facet Fudenberg, Geoff
Kelley, David R.
Pollard, Katherine S.
author_sort Fudenberg, Geoff
collection PubMed
description In interphase, the human genome sequence folds in three dimensions into a rich variety of locus-specific contact patterns. Cohesin and CTCF are key regulators; perturbing the levels of either greatly disrupts genome-wide folding as assayed by chromosome conformation capture methods. Still, how a given DNA sequence encodes a particular locus-specific folding pattern remains unknown. Here we present a convolutional neural network, Akita, that accurately predicts genome folding from DNA sequence alone. Representations learned by Akita underscore the importance of an orientation-specific grammar for CTCF binding sites. Akita learns predictive nucleotide-level features of genome folding, revealing impacts of nucleotides beyond the core CTCF motif. Once trained, Akita enables rapid in silico predictions. Leveraging this, we demonstrate how Akita can be used to perform in silico saturation mutagenesis, interpret eQTLs, make predictions for structural variants, and probe species-specific genome folding. Collectively, these results enable decoding genome function from sequence through structure.
format Online
Article
Text
id pubmed-8211359
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-82113592021-06-17 Predicting 3D genome folding from DNA sequence with Akita Fudenberg, Geoff Kelley, David R. Pollard, Katherine S. Nat Methods Article In interphase, the human genome sequence folds in three dimensions into a rich variety of locus-specific contact patterns. Cohesin and CTCF are key regulators; perturbing the levels of either greatly disrupts genome-wide folding as assayed by chromosome conformation capture methods. Still, how a given DNA sequence encodes a particular locus-specific folding pattern remains unknown. Here we present a convolutional neural network, Akita, that accurately predicts genome folding from DNA sequence alone. Representations learned by Akita underscore the importance of an orientation-specific grammar for CTCF binding sites. Akita learns predictive nucleotide-level features of genome folding, revealing impacts of nucleotides beyond the core CTCF motif. Once trained, Akita enables rapid in silico predictions. Leveraging this, we demonstrate how Akita can be used to perform in silico saturation mutagenesis, interpret eQTLs, make predictions for structural variants, and probe species-specific genome folding. Collectively, these results enable decoding genome function from sequence through structure. 2020-10-12 2020-11 /pmc/articles/PMC8211359/ /pubmed/33046897 http://dx.doi.org/10.1038/s41592-020-0958-x Text en http://www.nature.com/authors/editorial_policies/license.html#termsUsers may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Fudenberg, Geoff
Kelley, David R.
Pollard, Katherine S.
Predicting 3D genome folding from DNA sequence with Akita
title Predicting 3D genome folding from DNA sequence with Akita
title_full Predicting 3D genome folding from DNA sequence with Akita
title_fullStr Predicting 3D genome folding from DNA sequence with Akita
title_full_unstemmed Predicting 3D genome folding from DNA sequence with Akita
title_short Predicting 3D genome folding from DNA sequence with Akita
title_sort predicting 3d genome folding from dna sequence with akita
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8211359/
https://www.ncbi.nlm.nih.gov/pubmed/33046897
http://dx.doi.org/10.1038/s41592-020-0958-x
work_keys_str_mv AT fudenberggeoff predicting3dgenomefoldingfromdnasequencewithakita
AT kelleydavidr predicting3dgenomefoldingfromdnasequencewithakita
AT pollardkatherines predicting3dgenomefoldingfromdnasequencewithakita