Cargando…

TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering()

Topologically associated domains (TADs) play a pivotal role in disease detection. This study introduces a novel TADs recognition approach named TOAST, leveraging graph auto-encoders and clustering techniques. TOAST conceptualizes each genomic bin as a node of a graph and employs the Hi-C contact mat...

Descripción completa

Detalles Bibliográficos
Autores principales: Gong, Haiyan, Zhang, Dawei, Zhang, Xiaotong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562672/
https://www.ncbi.nlm.nih.gov/pubmed/37822562
http://dx.doi.org/10.1016/j.csbj.2023.09.019
_version_ 1785118179807199232
author Gong, Haiyan
Zhang, Dawei
Zhang, Xiaotong
author_facet Gong, Haiyan
Zhang, Dawei
Zhang, Xiaotong
author_sort Gong, Haiyan
collection PubMed
description Topologically associated domains (TADs) play a pivotal role in disease detection. This study introduces a novel TADs recognition approach named TOAST, leveraging graph auto-encoders and clustering techniques. TOAST conceptualizes each genomic bin as a node of a graph and employs the Hi-C contact matrix as the graph's adjacency matrix. By employing graph auto-encoders, TOAST generates informative embeddings as features. Subsequently, the unsupervised clustering algorithm HDBSCAN is utilized to assign labels to each genomic bin, facilitating the identification of contiguous regions with the same label as TADs. Our experimental analysis of several simulated Hi-C data sets shows that TOAST can quickly and accurately identify TADs from different types of simulated Hi-C contact matrices, outperforming existing algorithms. We also determined the anchoring ratio of TAD boundaries by analyzing different TAD recognition algorithms, and obtained an average ratio of anchoring CTCF, SMC3, RAD21, POLR2A, H3K36me3, H3K9me3, H3K4me3, H3K4me1, Enhancer, and Promoters of 0.66, 0.47, 0.54, 0.27, 0.24, 0.12, 0.32, 0.41, 0.26, and 0.13, respectively. In conclusion, TOAST is a method that can quickly identify TAD boundary parameters that are easy to understand and have important biological significance. The TOAST web server can be accessed via http://223.223.185.189:4005/. The code of TOAST is available online at https://github.com/ghaiyan/TOAST.
format Online
Article
Text
id pubmed-10562672
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-105626722023-10-11 TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering() Gong, Haiyan Zhang, Dawei Zhang, Xiaotong Comput Struct Biotechnol J Method Article Topologically associated domains (TADs) play a pivotal role in disease detection. This study introduces a novel TADs recognition approach named TOAST, leveraging graph auto-encoders and clustering techniques. TOAST conceptualizes each genomic bin as a node of a graph and employs the Hi-C contact matrix as the graph's adjacency matrix. By employing graph auto-encoders, TOAST generates informative embeddings as features. Subsequently, the unsupervised clustering algorithm HDBSCAN is utilized to assign labels to each genomic bin, facilitating the identification of contiguous regions with the same label as TADs. Our experimental analysis of several simulated Hi-C data sets shows that TOAST can quickly and accurately identify TADs from different types of simulated Hi-C contact matrices, outperforming existing algorithms. We also determined the anchoring ratio of TAD boundaries by analyzing different TAD recognition algorithms, and obtained an average ratio of anchoring CTCF, SMC3, RAD21, POLR2A, H3K36me3, H3K9me3, H3K4me3, H3K4me1, Enhancer, and Promoters of 0.66, 0.47, 0.54, 0.27, 0.24, 0.12, 0.32, 0.41, 0.26, and 0.13, respectively. In conclusion, TOAST is a method that can quickly identify TAD boundary parameters that are easy to understand and have important biological significance. The TOAST web server can be accessed via http://223.223.185.189:4005/. The code of TOAST is available online at https://github.com/ghaiyan/TOAST. Research Network of Computational and Structural Biotechnology 2023-09-27 /pmc/articles/PMC10562672/ /pubmed/37822562 http://dx.doi.org/10.1016/j.csbj.2023.09.019 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Method Article
Gong, Haiyan
Zhang, Dawei
Zhang, Xiaotong
TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering()
title TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering()
title_full TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering()
title_fullStr TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering()
title_full_unstemmed TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering()
title_short TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering()
title_sort toast: a novel method for identifying topologically associated domains based on graph auto-encoders and clustering()
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562672/
https://www.ncbi.nlm.nih.gov/pubmed/37822562
http://dx.doi.org/10.1016/j.csbj.2023.09.019
work_keys_str_mv AT gonghaiyan toastanovelmethodforidentifyingtopologicallyassociateddomainsbasedongraphautoencodersandclustering
AT zhangdawei toastanovelmethodforidentifyingtopologicallyassociateddomainsbasedongraphautoencodersandclustering
AT zhangxiaotong toastanovelmethodforidentifyingtopologicallyassociateddomainsbasedongraphautoencodersandclustering