Cargando…

HiC4D: forecasting spatiotemporal Hi-C data with residual ConvLSTM

The Hi-C experiments have been extensively used for the studies of genomic structures. In the last few years, spatiotemporal Hi-C has largely contributed to the investigation of genome dynamic reorganization. However, computationally modeling and forecasting spatiotemporal Hi-C data still have not b...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Tong, Wang, Zheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516390/
https://www.ncbi.nlm.nih.gov/pubmed/37478379
http://dx.doi.org/10.1093/bib/bbad263
_version_ 1785109120101122048
author Liu, Tong
Wang, Zheng
author_facet Liu, Tong
Wang, Zheng
author_sort Liu, Tong
collection PubMed
description The Hi-C experiments have been extensively used for the studies of genomic structures. In the last few years, spatiotemporal Hi-C has largely contributed to the investigation of genome dynamic reorganization. However, computationally modeling and forecasting spatiotemporal Hi-C data still have not been seen in the literature. We present HiC4D for dealing with the problem of forecasting spatiotemporal Hi-C data. We designed and benchmarked a novel network and named it residual ConvLSTM (ResConvLSTM), which is a combination of residual network and convolutional long short-term memory (ConvLSTM). We evaluated our new ResConvLSTM networks and compared them with the other five methods, including a naïve network (NaiveNet) that we designed as a baseline method and four outstanding video-prediction methods from the literature: ConvLSTM, spatiotemporal LSTM (ST-LSTM), self-attention LSTM (SA-LSTM) and simple video prediction (SimVP). We used eight different spatiotemporal Hi-C datasets for the blind test, including two from mouse embryogenesis, one from somatic cell nuclear transfer (SCNT) embryos, three embryogenesis datasets from different species and two non-embryogenesis datasets. Our evaluation results indicate that our ResConvLSTM networks almost always outperform the other methods on the eight blind-test datasets in terms of accurately predicting the Hi-C contact matrices at future time-steps. Our benchmarks also indicate that all of the methods that we benchmarked can successfully recover the boundaries of topologically associating domains called on the experimental Hi-C contact matrices. Taken together, our benchmarks suggest that HiC4D is an effective tool for predicting spatiotemporal Hi-C data. HiC4D is publicly available at both http://dna.cs.miami.edu/HiC4D/ and https://github.com/zwang-bioinformatics/HiC4D/.
format Online
Article
Text
id pubmed-10516390
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105163902023-09-23 HiC4D: forecasting spatiotemporal Hi-C data with residual ConvLSTM Liu, Tong Wang, Zheng Brief Bioinform Problem Solving Protocol The Hi-C experiments have been extensively used for the studies of genomic structures. In the last few years, spatiotemporal Hi-C has largely contributed to the investigation of genome dynamic reorganization. However, computationally modeling and forecasting spatiotemporal Hi-C data still have not been seen in the literature. We present HiC4D for dealing with the problem of forecasting spatiotemporal Hi-C data. We designed and benchmarked a novel network and named it residual ConvLSTM (ResConvLSTM), which is a combination of residual network and convolutional long short-term memory (ConvLSTM). We evaluated our new ResConvLSTM networks and compared them with the other five methods, including a naïve network (NaiveNet) that we designed as a baseline method and four outstanding video-prediction methods from the literature: ConvLSTM, spatiotemporal LSTM (ST-LSTM), self-attention LSTM (SA-LSTM) and simple video prediction (SimVP). We used eight different spatiotemporal Hi-C datasets for the blind test, including two from mouse embryogenesis, one from somatic cell nuclear transfer (SCNT) embryos, three embryogenesis datasets from different species and two non-embryogenesis datasets. Our evaluation results indicate that our ResConvLSTM networks almost always outperform the other methods on the eight blind-test datasets in terms of accurately predicting the Hi-C contact matrices at future time-steps. Our benchmarks also indicate that all of the methods that we benchmarked can successfully recover the boundaries of topologically associating domains called on the experimental Hi-C contact matrices. Taken together, our benchmarks suggest that HiC4D is an effective tool for predicting spatiotemporal Hi-C data. HiC4D is publicly available at both http://dna.cs.miami.edu/HiC4D/ and https://github.com/zwang-bioinformatics/HiC4D/. Oxford University Press 2023-07-20 /pmc/articles/PMC10516390/ /pubmed/37478379 http://dx.doi.org/10.1093/bib/bbad263 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Liu, Tong
Wang, Zheng
HiC4D: forecasting spatiotemporal Hi-C data with residual ConvLSTM
title HiC4D: forecasting spatiotemporal Hi-C data with residual ConvLSTM
title_full HiC4D: forecasting spatiotemporal Hi-C data with residual ConvLSTM
title_fullStr HiC4D: forecasting spatiotemporal Hi-C data with residual ConvLSTM
title_full_unstemmed HiC4D: forecasting spatiotemporal Hi-C data with residual ConvLSTM
title_short HiC4D: forecasting spatiotemporal Hi-C data with residual ConvLSTM
title_sort hic4d: forecasting spatiotemporal hi-c data with residual convlstm
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516390/
https://www.ncbi.nlm.nih.gov/pubmed/37478379
http://dx.doi.org/10.1093/bib/bbad263
work_keys_str_mv AT liutong hic4dforecastingspatiotemporalhicdatawithresidualconvlstm
AT wangzheng hic4dforecastingspatiotemporalhicdatawithresidualconvlstm