Cargando…
diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data
BACKGROUND: Chromatin conformation capture with high-throughput sequencing (Hi-C) is a technique that measures the in vivo intensity of interactions between all pairs of loci in the genome. Most conventional analyses of Hi-C data focus on the detection of statistically significant interactions. Howe...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539688/ https://www.ncbi.nlm.nih.gov/pubmed/26283514 http://dx.doi.org/10.1186/s12859-015-0683-0 |
_version_ | 1782386144640499712 |
---|---|
author | Lun, Aaron T.L. Smyth, Gordon K. |
author_facet | Lun, Aaron T.L. Smyth, Gordon K. |
author_sort | Lun, Aaron T.L. |
collection | PubMed |
description | BACKGROUND: Chromatin conformation capture with high-throughput sequencing (Hi-C) is a technique that measures the in vivo intensity of interactions between all pairs of loci in the genome. Most conventional analyses of Hi-C data focus on the detection of statistically significant interactions. However, an alternative strategy involves identifying significant changes in the interaction intensity (i.e., differential interactions) between two or more biological conditions. This is more statistically rigorous and may provide more biologically relevant results. RESULTS: Here, we present the diffHic software package for the detection of differential interactions from Hi-C data. diffHic provides methods for read pair alignment and processing, counting into bin pairs, filtering out low-abundance events and normalization of trended or CNV-driven biases. It uses the statistical framework of the edgeR package to model biological variability and to test for significant differences between conditions. Several options for the visualization of results are also included. The use of diffHic is demonstrated with real Hi-C data sets. Performance against existing methods is also evaluated with simulated data. CONCLUSIONS: On real data, diffHic is able to successfully detect interactions with significant differences in intensity between biological conditions. It also compares favourably to existing software tools on simulated data sets. These results suggest that diffHic is a viable approach for differential analyses of Hi-C data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0683-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4539688 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45396882015-08-19 diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data Lun, Aaron T.L. Smyth, Gordon K. BMC Bioinformatics Software BACKGROUND: Chromatin conformation capture with high-throughput sequencing (Hi-C) is a technique that measures the in vivo intensity of interactions between all pairs of loci in the genome. Most conventional analyses of Hi-C data focus on the detection of statistically significant interactions. However, an alternative strategy involves identifying significant changes in the interaction intensity (i.e., differential interactions) between two or more biological conditions. This is more statistically rigorous and may provide more biologically relevant results. RESULTS: Here, we present the diffHic software package for the detection of differential interactions from Hi-C data. diffHic provides methods for read pair alignment and processing, counting into bin pairs, filtering out low-abundance events and normalization of trended or CNV-driven biases. It uses the statistical framework of the edgeR package to model biological variability and to test for significant differences between conditions. Several options for the visualization of results are also included. The use of diffHic is demonstrated with real Hi-C data sets. Performance against existing methods is also evaluated with simulated data. CONCLUSIONS: On real data, diffHic is able to successfully detect interactions with significant differences in intensity between biological conditions. It also compares favourably to existing software tools on simulated data sets. These results suggest that diffHic is a viable approach for differential analyses of Hi-C data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0683-0) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-19 /pmc/articles/PMC4539688/ /pubmed/26283514 http://dx.doi.org/10.1186/s12859-015-0683-0 Text en © Lun and Smyth. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Lun, Aaron T.L. Smyth, Gordon K. diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data |
title | diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data |
title_full | diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data |
title_fullStr | diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data |
title_full_unstemmed | diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data |
title_short | diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data |
title_sort | diffhic: a bioconductor package to detect differential genomic interactions in hi-c data |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539688/ https://www.ncbi.nlm.nih.gov/pubmed/26283514 http://dx.doi.org/10.1186/s12859-015-0683-0 |
work_keys_str_mv | AT lunaarontl diffhicabioconductorpackagetodetectdifferentialgenomicinteractionsinhicdata AT smythgordonk diffhicabioconductorpackagetodetectdifferentialgenomicinteractionsinhicdata |