Cargando…
preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution
MOTIVATION: Chromosome conformation capture technologies (Hi-C) revealed extensive DNA folding into discrete 3D domains, such as Topologically Associating Domains and chromatin loops. The correct binding of CTCF and cohesin at domain boundaries is integral in maintaining the proper structure and fun...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8756196/ https://www.ncbi.nlm.nih.gov/pubmed/34741515 http://dx.doi.org/10.1093/bioinformatics/btab743 |
_version_ | 1784632516578115584 |
---|---|
author | Stilianoudakis, Spiro C Marshall, Maggie A Dozmorov, Mikhail G |
author_facet | Stilianoudakis, Spiro C Marshall, Maggie A Dozmorov, Mikhail G |
author_sort | Stilianoudakis, Spiro C |
collection | PubMed |
description | MOTIVATION: Chromosome conformation capture technologies (Hi-C) revealed extensive DNA folding into discrete 3D domains, such as Topologically Associating Domains and chromatin loops. The correct binding of CTCF and cohesin at domain boundaries is integral in maintaining the proper structure and function of these 3D domains. 3D domains have been mapped at the resolutions of 1 kilobase and above. However, it has not been possible to define their boundaries at the resolution of boundary-forming proteins. RESULTS: To predict domain boundaries at base-pair resolution, we developed preciseTAD, an optimized transfer learning framework trained on high-resolution genome annotation data. In contrast to current TAD/loop callers, preciseTAD-predicted boundaries are strongly supported by experimental evidence. Importantly, this approach can accurately delineate boundaries in cells without Hi-C data. preciseTAD provides a powerful framework to improve our understanding of how genomic regulators are shaping the 3D structure of the genome at base-pair resolution. AVAILABILITY AND IMPLEMENTATION: preciseTAD is an R/Bioconductor package available at https://bioconductor.org/packages/preciseTAD/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8756196 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-87561962022-01-13 preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution Stilianoudakis, Spiro C Marshall, Maggie A Dozmorov, Mikhail G Bioinformatics Original Papers MOTIVATION: Chromosome conformation capture technologies (Hi-C) revealed extensive DNA folding into discrete 3D domains, such as Topologically Associating Domains and chromatin loops. The correct binding of CTCF and cohesin at domain boundaries is integral in maintaining the proper structure and function of these 3D domains. 3D domains have been mapped at the resolutions of 1 kilobase and above. However, it has not been possible to define their boundaries at the resolution of boundary-forming proteins. RESULTS: To predict domain boundaries at base-pair resolution, we developed preciseTAD, an optimized transfer learning framework trained on high-resolution genome annotation data. In contrast to current TAD/loop callers, preciseTAD-predicted boundaries are strongly supported by experimental evidence. Importantly, this approach can accurately delineate boundaries in cells without Hi-C data. preciseTAD provides a powerful framework to improve our understanding of how genomic regulators are shaping the 3D structure of the genome at base-pair resolution. AVAILABILITY AND IMPLEMENTATION: preciseTAD is an R/Bioconductor package available at https://bioconductor.org/packages/preciseTAD/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-11-06 /pmc/articles/PMC8756196/ /pubmed/34741515 http://dx.doi.org/10.1093/bioinformatics/btab743 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Stilianoudakis, Spiro C Marshall, Maggie A Dozmorov, Mikhail G preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution |
title | preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution |
title_full | preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution |
title_fullStr | preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution |
title_full_unstemmed | preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution |
title_short | preciseTAD: a transfer learning framework for 3D domain boundary prediction at base-pair resolution |
title_sort | precisetad: a transfer learning framework for 3d domain boundary prediction at base-pair resolution |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8756196/ https://www.ncbi.nlm.nih.gov/pubmed/34741515 http://dx.doi.org/10.1093/bioinformatics/btab743 |
work_keys_str_mv | AT stilianoudakisspiroc precisetadatransferlearningframeworkfor3ddomainboundarypredictionatbasepairresolution AT marshallmaggiea precisetadatransferlearningframeworkfor3ddomainboundarypredictionatbasepairresolution AT dozmorovmikhailg precisetadatransferlearningframeworkfor3ddomainboundarypredictionatbasepairresolution |