Cargando…

GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data

Graph and image are two common representations of Hi-C cis-contact maps. Existing computational tools have only adopted Hi-C data modeled as unitary data structures but neglected the potential advantages of synergizing the information of different views. Here we propose GILoop, a dual-branch neural...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Fuzhou, Gao, Tingxiao, Lin, Jiecong, Zheng, Zetian, Huang, Lei, Toseef, Muhammad, Li, Xiangtao, Wong, Ka-Chun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9700007/
https://www.ncbi.nlm.nih.gov/pubmed/36444296
http://dx.doi.org/10.1016/j.isci.2022.105535
_version_ 1784839211052957696
author Wang, Fuzhou
Gao, Tingxiao
Lin, Jiecong
Zheng, Zetian
Huang, Lei
Toseef, Muhammad
Li, Xiangtao
Wong, Ka-Chun
author_facet Wang, Fuzhou
Gao, Tingxiao
Lin, Jiecong
Zheng, Zetian
Huang, Lei
Toseef, Muhammad
Li, Xiangtao
Wong, Ka-Chun
author_sort Wang, Fuzhou
collection PubMed
description Graph and image are two common representations of Hi-C cis-contact maps. Existing computational tools have only adopted Hi-C data modeled as unitary data structures but neglected the potential advantages of synergizing the information of different views. Here we propose GILoop, a dual-branch neural network that learns from both representations to identify genome-wide CTCF-mediated loops. With GILoop, we explore the combined strength of integrating the two view representations of Hi-C data and corroborate the complementary relationship between the views. In particular, the model outperforms the state-of-the-art loop calling framework and is also more robust against low-quality Hi-C libraries. We also uncover distinct preferences for matrix density by graph-based and image-based models, revealing interesting insights into Hi-C data elucidation. Finally, along with multiple transfer-learning case studies, we demonstrate that GILoop can accurately model the organizational and functional patterns of CTCF-mediated looping across different cell lines.
format Online
Article
Text
id pubmed-9700007
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-97000072022-11-27 GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data Wang, Fuzhou Gao, Tingxiao Lin, Jiecong Zheng, Zetian Huang, Lei Toseef, Muhammad Li, Xiangtao Wong, Ka-Chun iScience Article Graph and image are two common representations of Hi-C cis-contact maps. Existing computational tools have only adopted Hi-C data modeled as unitary data structures but neglected the potential advantages of synergizing the information of different views. Here we propose GILoop, a dual-branch neural network that learns from both representations to identify genome-wide CTCF-mediated loops. With GILoop, we explore the combined strength of integrating the two view representations of Hi-C data and corroborate the complementary relationship between the views. In particular, the model outperforms the state-of-the-art loop calling framework and is also more robust against low-quality Hi-C libraries. We also uncover distinct preferences for matrix density by graph-based and image-based models, revealing interesting insights into Hi-C data elucidation. Finally, along with multiple transfer-learning case studies, we demonstrate that GILoop can accurately model the organizational and functional patterns of CTCF-mediated looping across different cell lines. Elsevier 2022-11-10 /pmc/articles/PMC9700007/ /pubmed/36444296 http://dx.doi.org/10.1016/j.isci.2022.105535 Text en © 2022. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Wang, Fuzhou
Gao, Tingxiao
Lin, Jiecong
Zheng, Zetian
Huang, Lei
Toseef, Muhammad
Li, Xiangtao
Wong, Ka-Chun
GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data
title GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data
title_full GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data
title_fullStr GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data
title_full_unstemmed GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data
title_short GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data
title_sort giloop: robust chromatin loop calling across multiple sequencing depths on hi-c data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9700007/
https://www.ncbi.nlm.nih.gov/pubmed/36444296
http://dx.doi.org/10.1016/j.isci.2022.105535
work_keys_str_mv AT wangfuzhou gilooprobustchromatinloopcallingacrossmultiplesequencingdepthsonhicdata
AT gaotingxiao gilooprobustchromatinloopcallingacrossmultiplesequencingdepthsonhicdata
AT linjiecong gilooprobustchromatinloopcallingacrossmultiplesequencingdepthsonhicdata
AT zhengzetian gilooprobustchromatinloopcallingacrossmultiplesequencingdepthsonhicdata
AT huanglei gilooprobustchromatinloopcallingacrossmultiplesequencingdepthsonhicdata
AT toseefmuhammad gilooprobustchromatinloopcallingacrossmultiplesequencingdepthsonhicdata
AT lixiangtao gilooprobustchromatinloopcallingacrossmultiplesequencingdepthsonhicdata
AT wongkachun gilooprobustchromatinloopcallingacrossmultiplesequencingdepthsonhicdata