Cargando…
TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering()
Topologically associated domains (TADs) play a pivotal role in disease detection. This study introduces a novel TADs recognition approach named TOAST, leveraging graph auto-encoders and clustering techniques. TOAST conceptualizes each genomic bin as a node of a graph and employs the Hi-C contact mat...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562672/ https://www.ncbi.nlm.nih.gov/pubmed/37822562 http://dx.doi.org/10.1016/j.csbj.2023.09.019 |
_version_ | 1785118179807199232 |
---|---|
author | Gong, Haiyan Zhang, Dawei Zhang, Xiaotong |
author_facet | Gong, Haiyan Zhang, Dawei Zhang, Xiaotong |
author_sort | Gong, Haiyan |
collection | PubMed |
description | Topologically associated domains (TADs) play a pivotal role in disease detection. This study introduces a novel TADs recognition approach named TOAST, leveraging graph auto-encoders and clustering techniques. TOAST conceptualizes each genomic bin as a node of a graph and employs the Hi-C contact matrix as the graph's adjacency matrix. By employing graph auto-encoders, TOAST generates informative embeddings as features. Subsequently, the unsupervised clustering algorithm HDBSCAN is utilized to assign labels to each genomic bin, facilitating the identification of contiguous regions with the same label as TADs. Our experimental analysis of several simulated Hi-C data sets shows that TOAST can quickly and accurately identify TADs from different types of simulated Hi-C contact matrices, outperforming existing algorithms. We also determined the anchoring ratio of TAD boundaries by analyzing different TAD recognition algorithms, and obtained an average ratio of anchoring CTCF, SMC3, RAD21, POLR2A, H3K36me3, H3K9me3, H3K4me3, H3K4me1, Enhancer, and Promoters of 0.66, 0.47, 0.54, 0.27, 0.24, 0.12, 0.32, 0.41, 0.26, and 0.13, respectively. In conclusion, TOAST is a method that can quickly identify TAD boundary parameters that are easy to understand and have important biological significance. The TOAST web server can be accessed via http://223.223.185.189:4005/. The code of TOAST is available online at https://github.com/ghaiyan/TOAST. |
format | Online Article Text |
id | pubmed-10562672 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-105626722023-10-11 TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering() Gong, Haiyan Zhang, Dawei Zhang, Xiaotong Comput Struct Biotechnol J Method Article Topologically associated domains (TADs) play a pivotal role in disease detection. This study introduces a novel TADs recognition approach named TOAST, leveraging graph auto-encoders and clustering techniques. TOAST conceptualizes each genomic bin as a node of a graph and employs the Hi-C contact matrix as the graph's adjacency matrix. By employing graph auto-encoders, TOAST generates informative embeddings as features. Subsequently, the unsupervised clustering algorithm HDBSCAN is utilized to assign labels to each genomic bin, facilitating the identification of contiguous regions with the same label as TADs. Our experimental analysis of several simulated Hi-C data sets shows that TOAST can quickly and accurately identify TADs from different types of simulated Hi-C contact matrices, outperforming existing algorithms. We also determined the anchoring ratio of TAD boundaries by analyzing different TAD recognition algorithms, and obtained an average ratio of anchoring CTCF, SMC3, RAD21, POLR2A, H3K36me3, H3K9me3, H3K4me3, H3K4me1, Enhancer, and Promoters of 0.66, 0.47, 0.54, 0.27, 0.24, 0.12, 0.32, 0.41, 0.26, and 0.13, respectively. In conclusion, TOAST is a method that can quickly identify TAD boundary parameters that are easy to understand and have important biological significance. The TOAST web server can be accessed via http://223.223.185.189:4005/. The code of TOAST is available online at https://github.com/ghaiyan/TOAST. Research Network of Computational and Structural Biotechnology 2023-09-27 /pmc/articles/PMC10562672/ /pubmed/37822562 http://dx.doi.org/10.1016/j.csbj.2023.09.019 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Method Article Gong, Haiyan Zhang, Dawei Zhang, Xiaotong TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering() |
title | TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering() |
title_full | TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering() |
title_fullStr | TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering() |
title_full_unstemmed | TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering() |
title_short | TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering() |
title_sort | toast: a novel method for identifying topologically associated domains based on graph auto-encoders and clustering() |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562672/ https://www.ncbi.nlm.nih.gov/pubmed/37822562 http://dx.doi.org/10.1016/j.csbj.2023.09.019 |
work_keys_str_mv | AT gonghaiyan toastanovelmethodforidentifyingtopologicallyassociateddomainsbasedongraphautoencodersandclustering AT zhangdawei toastanovelmethodforidentifyingtopologicallyassociateddomainsbasedongraphautoencodersandclustering AT zhangxiaotong toastanovelmethodforidentifyingtopologicallyassociateddomainsbasedongraphautoencodersandclustering |