Cargando…
Two-dimensional segmentation for analyzing Hi-C data
Motivation: The spatial conformation of the chromosome has a deep influence on gene regulation and expression. Hi-C technology allows the evaluation of the spatial proximity between any pair of loci along the genome. It results in a data matrix where blocks corresponding to (self-)interacting region...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147896/ https://www.ncbi.nlm.nih.gov/pubmed/25161224 http://dx.doi.org/10.1093/bioinformatics/btu443 |
_version_ | 1782332532343177216 |
---|---|
author | Lévy-Leduc, Celine Delattre, M. Mary-Huard, T. Robin, S. |
author_facet | Lévy-Leduc, Celine Delattre, M. Mary-Huard, T. Robin, S. |
author_sort | Lévy-Leduc, Celine |
collection | PubMed |
description | Motivation: The spatial conformation of the chromosome has a deep influence on gene regulation and expression. Hi-C technology allows the evaluation of the spatial proximity between any pair of loci along the genome. It results in a data matrix where blocks corresponding to (self-)interacting regions appear. The delimitation of such blocks is critical to better understand the spatial organization of the chromatin. From a computational point of view, it results in a 2D segmentation problem. Results: We focus on the detection of cis-interacting regions, which appear to be prominent in observed data. We define a block-wise segmentation model for the detection of such regions. We prove that the maximization of the likelihood with respect to the block boundaries can be rephrased in terms of a 1D segmentation problem, for which the standard dynamic programming applies. The performance of the proposed methods is assessed by a simulation study on both synthetic and resampled data. A comparative study on public data shows good concordance with biologically confirmed regions. Availability and implementation: The HiCseg R package is available from the Comprehensive R Archive Network and from the Web page of the corresponding author. Contact: celine.levy-leduc@agroparistech.fr |
format | Online Article Text |
id | pubmed-4147896 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-41478962014-09-02 Two-dimensional segmentation for analyzing Hi-C data Lévy-Leduc, Celine Delattre, M. Mary-Huard, T. Robin, S. Bioinformatics Eccb 2014 Proceedings Papers Committee Motivation: The spatial conformation of the chromosome has a deep influence on gene regulation and expression. Hi-C technology allows the evaluation of the spatial proximity between any pair of loci along the genome. It results in a data matrix where blocks corresponding to (self-)interacting regions appear. The delimitation of such blocks is critical to better understand the spatial organization of the chromatin. From a computational point of view, it results in a 2D segmentation problem. Results: We focus on the detection of cis-interacting regions, which appear to be prominent in observed data. We define a block-wise segmentation model for the detection of such regions. We prove that the maximization of the likelihood with respect to the block boundaries can be rephrased in terms of a 1D segmentation problem, for which the standard dynamic programming applies. The performance of the proposed methods is assessed by a simulation study on both synthetic and resampled data. A comparative study on public data shows good concordance with biologically confirmed regions. Availability and implementation: The HiCseg R package is available from the Comprehensive R Archive Network and from the Web page of the corresponding author. Contact: celine.levy-leduc@agroparistech.fr Oxford University Press 2014-09-01 2014-08-22 /pmc/articles/PMC4147896/ /pubmed/25161224 http://dx.doi.org/10.1093/bioinformatics/btu443 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Eccb 2014 Proceedings Papers Committee Lévy-Leduc, Celine Delattre, M. Mary-Huard, T. Robin, S. Two-dimensional segmentation for analyzing Hi-C data |
title | Two-dimensional segmentation for analyzing Hi-C data |
title_full | Two-dimensional segmentation for analyzing Hi-C data |
title_fullStr | Two-dimensional segmentation for analyzing Hi-C data |
title_full_unstemmed | Two-dimensional segmentation for analyzing Hi-C data |
title_short | Two-dimensional segmentation for analyzing Hi-C data |
title_sort | two-dimensional segmentation for analyzing hi-c data |
topic | Eccb 2014 Proceedings Papers Committee |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147896/ https://www.ncbi.nlm.nih.gov/pubmed/25161224 http://dx.doi.org/10.1093/bioinformatics/btu443 |
work_keys_str_mv | AT levyleducceline twodimensionalsegmentationforanalyzinghicdata AT delattrem twodimensionalsegmentationforanalyzinghicdata AT maryhuardt twodimensionalsegmentationforanalyzinghicdata AT robins twodimensionalsegmentationforanalyzinghicdata |