Cargando…

preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution

MOTIVATION: Chromosome conformation capture technologies (Hi-C) revealed extensive DNA folding into discrete 3D domains, such as Topologically Associating Domains and chromatin loops. The correct binding of CTCF and cohesin at domain boundaries is integral in maintaining the proper structure and fun...

Descripción completa

Detalles Bibliográficos
Autores principales: Stilianoudakis, Spiro C, Marshall, Maggie A, Dozmorov, Mikhail G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8756196/
https://www.ncbi.nlm.nih.gov/pubmed/34741515
http://dx.doi.org/10.1093/bioinformatics/btab743
_version_ 1784632516578115584
author Stilianoudakis, Spiro C
Marshall, Maggie A
Dozmorov, Mikhail G
author_facet Stilianoudakis, Spiro C
Marshall, Maggie A
Dozmorov, Mikhail G
author_sort Stilianoudakis, Spiro C
collection PubMed
description MOTIVATION: Chromosome conformation capture technologies (Hi-C) revealed extensive DNA folding into discrete 3D domains, such as Topologically Associating Domains and chromatin loops. The correct binding of CTCF and cohesin at domain boundaries is integral in maintaining the proper structure and function of these 3D domains. 3D domains have been mapped at the resolutions of 1 kilobase and above. However, it has not been possible to define their boundaries at the resolution of boundary-forming proteins. RESULTS: To predict domain boundaries at base-pair resolution, we developed preciseTAD, an optimized transfer learning framework trained on high-resolution genome annotation data. In contrast to current TAD/loop callers, preciseTAD-predicted boundaries are strongly supported by experimental evidence. Importantly, this approach can accurately delineate boundaries in cells without Hi-C data. preciseTAD provides a powerful framework to improve our understanding of how genomic regulators are shaping the 3D structure of the genome at base-pair resolution. AVAILABILITY AND IMPLEMENTATION: preciseTAD is an R/Bioconductor package available at https://bioconductor.org/packages/preciseTAD/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8756196
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-87561962022-01-13 preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution Stilianoudakis, Spiro C Marshall, Maggie A Dozmorov, Mikhail G Bioinformatics Original Papers MOTIVATION: Chromosome conformation capture technologies (Hi-C) revealed extensive DNA folding into discrete 3D domains, such as Topologically Associating Domains and chromatin loops. The correct binding of CTCF and cohesin at domain boundaries is integral in maintaining the proper structure and function of these 3D domains. 3D domains have been mapped at the resolutions of 1 kilobase and above. However, it has not been possible to define their boundaries at the resolution of boundary-forming proteins. RESULTS: To predict domain boundaries at base-pair resolution, we developed preciseTAD, an optimized transfer learning framework trained on high-resolution genome annotation data. In contrast to current TAD/loop callers, preciseTAD-predicted boundaries are strongly supported by experimental evidence. Importantly, this approach can accurately delineate boundaries in cells without Hi-C data. preciseTAD provides a powerful framework to improve our understanding of how genomic regulators are shaping the 3D structure of the genome at base-pair resolution. AVAILABILITY AND IMPLEMENTATION: preciseTAD is an R/Bioconductor package available at https://bioconductor.org/packages/preciseTAD/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-11-06 /pmc/articles/PMC8756196/ /pubmed/34741515 http://dx.doi.org/10.1093/bioinformatics/btab743 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Stilianoudakis, Spiro C
Marshall, Maggie A
Dozmorov, Mikhail G
preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution
title preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution
title_full preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution
title_fullStr preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution
title_full_unstemmed preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution
title_short preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution
title_sort precisetad: a transfer learning framework for 3d domain boundary prediction at base-pair resolution
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8756196/
https://www.ncbi.nlm.nih.gov/pubmed/34741515
http://dx.doi.org/10.1093/bioinformatics/btab743
work_keys_str_mv AT stilianoudakisspiroc precisetadatransferlearningframeworkfor3ddomainboundarypredictionatbasepairresolution
AT marshallmaggiea precisetadatransferlearningframeworkfor3ddomainboundarypredictionatbasepairresolution
AT dozmorovmikhailg precisetadatransferlearningframeworkfor3ddomainboundarypredictionatbasepairresolution