Cargando…

HiCNN: a very deep convolutional neural network to better enhance the resolution of Hi-C data

MOTIVATION: High-resolution Hi-C data are indispensable for the studies of three-dimensional (3D) genome organization at kilobase level. However, generating high-resolution Hi-C data (e.g. 5 kb) by conducting Hi-C experiments needs millions of mammalian cells, which may eventually generate billions...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Tong, Wang, Zheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6821373/
https://www.ncbi.nlm.nih.gov/pubmed/31056636
http://dx.doi.org/10.1093/bioinformatics/btz251
_version_ 1783464130551742464
author Liu, Tong
Wang, Zheng
author_facet Liu, Tong
Wang, Zheng
author_sort Liu, Tong
collection PubMed
description MOTIVATION: High-resolution Hi-C data are indispensable for the studies of three-dimensional (3D) genome organization at kilobase level. However, generating high-resolution Hi-C data (e.g. 5 kb) by conducting Hi-C experiments needs millions of mammalian cells, which may eventually generate billions of paired-end reads with a high sequencing cost. Therefore, it will be important and helpful if we can enhance the resolutions of Hi-C data by computational methods. RESULTS: We developed a new computational method named HiCNN that used a 54-layer very deep convolutional neural network to enhance the resolutions of Hi-C data. The network contains both global and local residual learning with multiple speedup techniques included resulting in fast convergence. We used mean squared errors and Pearson’s correlation coefficients between real high-resolution and computationally predicted high-resolution Hi-C data to evaluate the method. The evaluation results show that HiCNN consistently outperforms HiCPlus, the only existing tool in the literature, when training and testing data are extracted from the same cell type (i.e. GM12878) and from two different cell types in the same or different species (i.e. GM12878 as training with K562 as testing, and GM12878 as training with CH12-LX as testing). We further found that the HiCNN-enhanced high-resolution Hi-C data are more consistent with real experimental high-resolution Hi-C data than HiCPlus-enhanced data in terms of indicating statistically significant interactions. Moreover, HiCNN can efficiently enhance low-resolution Hi-C data, which eventually helps recover two chromatin loops that were confirmed by 3D-FISH. AVAILABILITY AND IMPLEMENTATION: HiCNN is freely available at http://dna.cs.miami.edu/HiCNN/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6821373
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68213732019-11-04 HiCNN: a very deep convolutional neural network to better enhance the resolution of Hi-C data Liu, Tong Wang, Zheng Bioinformatics Original Papers MOTIVATION: High-resolution Hi-C data are indispensable for the studies of three-dimensional (3D) genome organization at kilobase level. However, generating high-resolution Hi-C data (e.g. 5 kb) by conducting Hi-C experiments needs millions of mammalian cells, which may eventually generate billions of paired-end reads with a high sequencing cost. Therefore, it will be important and helpful if we can enhance the resolutions of Hi-C data by computational methods. RESULTS: We developed a new computational method named HiCNN that used a 54-layer very deep convolutional neural network to enhance the resolutions of Hi-C data. The network contains both global and local residual learning with multiple speedup techniques included resulting in fast convergence. We used mean squared errors and Pearson’s correlation coefficients between real high-resolution and computationally predicted high-resolution Hi-C data to evaluate the method. The evaluation results show that HiCNN consistently outperforms HiCPlus, the only existing tool in the literature, when training and testing data are extracted from the same cell type (i.e. GM12878) and from two different cell types in the same or different species (i.e. GM12878 as training with K562 as testing, and GM12878 as training with CH12-LX as testing). We further found that the HiCNN-enhanced high-resolution Hi-C data are more consistent with real experimental high-resolution Hi-C data than HiCPlus-enhanced data in terms of indicating statistically significant interactions. Moreover, HiCNN can efficiently enhance low-resolution Hi-C data, which eventually helps recover two chromatin loops that were confirmed by 3D-FISH. AVAILABILITY AND IMPLEMENTATION: HiCNN is freely available at http://dna.cs.miami.edu/HiCNN/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-11-01 2019-04-09 /pmc/articles/PMC6821373/ /pubmed/31056636 http://dx.doi.org/10.1093/bioinformatics/btz251 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Liu, Tong
Wang, Zheng
HiCNN: a very deep convolutional neural network to better enhance the resolution of Hi-C data
title HiCNN: a very deep convolutional neural network to better enhance the resolution of Hi-C data
title_full HiCNN: a very deep convolutional neural network to better enhance the resolution of Hi-C data
title_fullStr HiCNN: a very deep convolutional neural network to better enhance the resolution of Hi-C data
title_full_unstemmed HiCNN: a very deep convolutional neural network to better enhance the resolution of Hi-C data
title_short HiCNN: a very deep convolutional neural network to better enhance the resolution of Hi-C data
title_sort hicnn: a very deep convolutional neural network to better enhance the resolution of hi-c data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6821373/
https://www.ncbi.nlm.nih.gov/pubmed/31056636
http://dx.doi.org/10.1093/bioinformatics/btz251
work_keys_str_mv AT liutong hicnnaverydeepconvolutionalneuralnetworktobetterenhancetheresolutionofhicdata
AT wangzheng hicnnaverydeepconvolutionalneuralnetworktobetterenhancetheresolutionofhicdata