Cargando…

diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data

BACKGROUND: Chromatin conformation capture with high-throughput sequencing (Hi-C) is a technique that measures the in vivo intensity of interactions between all pairs of loci in the genome. Most conventional analyses of Hi-C data focus on the detection of statistically significant interactions. Howe...

Descripción completa

Detalles Bibliográficos
Autores principales: Lun, Aaron T.L., Smyth, Gordon K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539688/
https://www.ncbi.nlm.nih.gov/pubmed/26283514
http://dx.doi.org/10.1186/s12859-015-0683-0
_version_ 1782386144640499712
author Lun, Aaron T.L.
Smyth, Gordon K.
author_facet Lun, Aaron T.L.
Smyth, Gordon K.
author_sort Lun, Aaron T.L.
collection PubMed
description BACKGROUND: Chromatin conformation capture with high-throughput sequencing (Hi-C) is a technique that measures the in vivo intensity of interactions between all pairs of loci in the genome. Most conventional analyses of Hi-C data focus on the detection of statistically significant interactions. However, an alternative strategy involves identifying significant changes in the interaction intensity (i.e., differential interactions) between two or more biological conditions. This is more statistically rigorous and may provide more biologically relevant results. RESULTS: Here, we present the diffHic software package for the detection of differential interactions from Hi-C data. diffHic provides methods for read pair alignment and processing, counting into bin pairs, filtering out low-abundance events and normalization of trended or CNV-driven biases. It uses the statistical framework of the edgeR package to model biological variability and to test for significant differences between conditions. Several options for the visualization of results are also included. The use of diffHic is demonstrated with real Hi-C data sets. Performance against existing methods is also evaluated with simulated data. CONCLUSIONS: On real data, diffHic is able to successfully detect interactions with significant differences in intensity between biological conditions. It also compares favourably to existing software tools on simulated data sets. These results suggest that diffHic is a viable approach for differential analyses of Hi-C data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0683-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4539688
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45396882015-08-19 diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data Lun, Aaron T.L. Smyth, Gordon K. BMC Bioinformatics Software BACKGROUND: Chromatin conformation capture with high-throughput sequencing (Hi-C) is a technique that measures the in vivo intensity of interactions between all pairs of loci in the genome. Most conventional analyses of Hi-C data focus on the detection of statistically significant interactions. However, an alternative strategy involves identifying significant changes in the interaction intensity (i.e., differential interactions) between two or more biological conditions. This is more statistically rigorous and may provide more biologically relevant results. RESULTS: Here, we present the diffHic software package for the detection of differential interactions from Hi-C data. diffHic provides methods for read pair alignment and processing, counting into bin pairs, filtering out low-abundance events and normalization of trended or CNV-driven biases. It uses the statistical framework of the edgeR package to model biological variability and to test for significant differences between conditions. Several options for the visualization of results are also included. The use of diffHic is demonstrated with real Hi-C data sets. Performance against existing methods is also evaluated with simulated data. CONCLUSIONS: On real data, diffHic is able to successfully detect interactions with significant differences in intensity between biological conditions. It also compares favourably to existing software tools on simulated data sets. These results suggest that diffHic is a viable approach for differential analyses of Hi-C data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0683-0) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-19 /pmc/articles/PMC4539688/ /pubmed/26283514 http://dx.doi.org/10.1186/s12859-015-0683-0 Text en © Lun and Smyth. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Lun, Aaron T.L.
Smyth, Gordon K.
diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data
title diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data
title_full diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data
title_fullStr diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data
title_full_unstemmed diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data
title_short diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data
title_sort diffhic: a bioconductor package to detect differential genomic interactions in hi-c data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539688/
https://www.ncbi.nlm.nih.gov/pubmed/26283514
http://dx.doi.org/10.1186/s12859-015-0683-0
work_keys_str_mv AT lunaarontl diffhicabioconductorpackagetodetectdifferentialgenomicinteractionsinhicdata
AT smythgordonk diffhicabioconductorpackagetodetectdifferentialgenomicinteractionsinhicdata