Cargando…
TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data
Topologically associating domains (TADs) are fundamental building blocks of three dimensional genome, and organized into complex hierarchies. Identifying hierarchical TADs on Hi-C data helps to understand the relationship between genome architectures and gene regulation. Herein we propose TADfit, a...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9209495/ https://www.ncbi.nlm.nih.gov/pubmed/35725901 http://dx.doi.org/10.1038/s42003-022-03546-y |
_version_ | 1784729968538812416 |
---|---|
author | Liu, Erhu Lyu, Hongqiang Peng, Qinke Liu, Yuan Wang, Tian Han, Jiuqiang |
author_facet | Liu, Erhu Lyu, Hongqiang Peng, Qinke Liu, Yuan Wang, Tian Han, Jiuqiang |
author_sort | Liu, Erhu |
collection | PubMed |
description | Topologically associating domains (TADs) are fundamental building blocks of three dimensional genome, and organized into complex hierarchies. Identifying hierarchical TADs on Hi-C data helps to understand the relationship between genome architectures and gene regulation. Herein we propose TADfit, a multivariate linear regression model for profiling hierarchical chromatin domains, which tries to fit the interaction frequencies in Hi-C contact matrix with and without replicates using all-possible hierarchical TADs, and the significant ones can be determined by the regression coefficients obtained with the help of an online learning solver called Follow-The-Regularized-Leader (FTRL). Beyond the existing methods, TADfit has an ability to handle multiple contact matrix replicates and find partially overlapping TADs on them, which helps to find the comprehensive underlying TADs across replicates from different experiments. The comparative results tell that TADfit has better accuracy and reproducibility, and the hierarchical TADs called by it exhibit a reasonable biological relevance. |
format | Online Article Text |
id | pubmed-9209495 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-92094952022-06-22 TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data Liu, Erhu Lyu, Hongqiang Peng, Qinke Liu, Yuan Wang, Tian Han, Jiuqiang Commun Biol Article Topologically associating domains (TADs) are fundamental building blocks of three dimensional genome, and organized into complex hierarchies. Identifying hierarchical TADs on Hi-C data helps to understand the relationship between genome architectures and gene regulation. Herein we propose TADfit, a multivariate linear regression model for profiling hierarchical chromatin domains, which tries to fit the interaction frequencies in Hi-C contact matrix with and without replicates using all-possible hierarchical TADs, and the significant ones can be determined by the regression coefficients obtained with the help of an online learning solver called Follow-The-Regularized-Leader (FTRL). Beyond the existing methods, TADfit has an ability to handle multiple contact matrix replicates and find partially overlapping TADs on them, which helps to find the comprehensive underlying TADs across replicates from different experiments. The comparative results tell that TADfit has better accuracy and reproducibility, and the hierarchical TADs called by it exhibit a reasonable biological relevance. Nature Publishing Group UK 2022-06-20 /pmc/articles/PMC9209495/ /pubmed/35725901 http://dx.doi.org/10.1038/s42003-022-03546-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Liu, Erhu Lyu, Hongqiang Peng, Qinke Liu, Yuan Wang, Tian Han, Jiuqiang TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data |
title | TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data |
title_full | TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data |
title_fullStr | TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data |
title_full_unstemmed | TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data |
title_short | TADfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate Hi-C data |
title_sort | tadfit is a multivariate linear regression model for profiling hierarchical chromatin domains on replicate hi-c data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9209495/ https://www.ncbi.nlm.nih.gov/pubmed/35725901 http://dx.doi.org/10.1038/s42003-022-03546-y |
work_keys_str_mv | AT liuerhu tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata AT lyuhongqiang tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata AT pengqinke tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata AT liuyuan tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata AT wangtian tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata AT hanjiuqiang tadfitisamultivariatelinearregressionmodelforprofilinghierarchicalchromatindomainsonreplicatehicdata |