Cargando…

TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data

Topologically associating domains (TADs) are fundamental building blocks of three dimensional genome, and organized into complex hierarchies. Identifying hierarchical TADs on Hi-C data helps to understand the relationship between genome architectures and gene regulation. Herein we propose TADfit, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Erhu, Lyu, Hongqiang, Peng, Qinke, Liu, Yuan, Wang, Tian, Han, Jiuqiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9209495/
https://www.ncbi.nlm.nih.gov/pubmed/35725901
http://dx.doi.org/10.1038/s42003-022-03546-y
_version_ 1784729968538812416
author Liu, Erhu
Lyu, Hongqiang
Peng, Qinke
Liu, Yuan
Wang, Tian
Han, Jiuqiang
author_facet Liu, Erhu
Lyu, Hongqiang
Peng, Qinke
Liu, Yuan
Wang, Tian
Han, Jiuqiang
author_sort Liu, Erhu
collection PubMed
description Topologically associating domains (TADs) are fundamental building blocks of three dimensional genome, and organized into complex hierarchies. Identifying hierarchical TADs on Hi-C data helps to understand the relationship between genome architectures and gene regulation. Herein we propose TADfit, a multivariate linear regression model for profiling hierarchical chromatin domains, which tries to fit the interaction frequencies in Hi-C contact matrix with and without replicates using all-possible hierarchical TADs, and the significant ones can be determined by the regression coefficients obtained with the help of an online learning solver called Follow-The-Regularized-Leader (FTRL). Beyond the existing methods, TADfit has an ability to handle multiple contact matrix replicates and find partially overlapping TADs on them, which helps to find the comprehensive underlying TADs across replicates from different experiments. The comparative results tell that TADfit has better accuracy and reproducibility, and the hierarchical TADs called by it exhibit a reasonable biological relevance.
format Online
Article
Text
id pubmed-9209495
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92094952022-06-22 TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data Liu, Erhu Lyu, Hongqiang Peng, Qinke Liu, Yuan Wang, Tian Han, Jiuqiang Commun Biol Article Topologically associating domains (TADs) are fundamental building blocks of three dimensional genome, and organized into complex hierarchies. Identifying hierarchical TADs on Hi-C data helps to understand the relationship between genome architectures and gene regulation. Herein we propose TADfit, a multivariate linear regression model for profiling hierarchical chromatin domains, which tries to fit the interaction frequencies in Hi-C contact matrix with and without replicates using all-possible hierarchical TADs, and the significant ones can be determined by the regression coefficients obtained with the help of an online learning solver called Follow-The-Regularized-Leader (FTRL). Beyond the existing methods, TADfit has an ability to handle multiple contact matrix replicates and find partially overlapping TADs on them, which helps to find the comprehensive underlying TADs across replicates from different experiments. The comparative results tell that TADfit has better accuracy and reproducibility, and the hierarchical TADs called by it exhibit a reasonable biological relevance. Nature Publishing Group UK 2022-06-20 /pmc/articles/PMC9209495/ /pubmed/35725901 http://dx.doi.org/10.1038/s42003-022-03546-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Liu, Erhu
Lyu, Hongqiang
Peng, Qinke
Liu, Yuan
Wang, Tian
Han, Jiuqiang
TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data
title TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data
title_full TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data
title_fullStr TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data
title_full_unstemmed TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data
title_short TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data
title_sort tadfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate hi-c data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9209495/
https://www.ncbi.nlm.nih.gov/pubmed/35725901
http://dx.doi.org/10.1038/s42003-022-03546-y
work_keys_str_mv AT liuerhu tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata
AT lyuhongqiang tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata
AT pengqinke tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata
AT liuyuan tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata
AT wangtian tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata
AT hanjiuqiang tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata