Cargando…
HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient
Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods for assessing Hi-C data reproducibility can produce misleading results because they ignore spatial features in Hi-C data, such as domain structure and distance dependence. We present HiCRep, a fra...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5668950/ https://www.ncbi.nlm.nih.gov/pubmed/28855260 http://dx.doi.org/10.1101/gr.220640.117 |
_version_ | 1783275766643949568 |
---|---|
author | Yang, Tao Zhang, Feipeng Yardımcı, Galip Gürkan Song, Fan Hardison, Ross C. Noble, William Stafford Yue, Feng Li, Qunhua |
author_facet | Yang, Tao Zhang, Feipeng Yardımcı, Galip Gürkan Song, Fan Hardison, Ross C. Noble, William Stafford Yue, Feng Li, Qunhua |
author_sort | Yang, Tao |
collection | PubMed |
description | Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods for assessing Hi-C data reproducibility can produce misleading results because they ignore spatial features in Hi-C data, such as domain structure and distance dependence. We present HiCRep, a framework for assessing the reproducibility of Hi-C data that systematically accounts for these features. In particular, we introduce a novel similarity measure, the stratum adjusted correlation coefficient (SCC), for quantifying the similarity between Hi-C interaction matrices. Not only does it provide a statistically sound and reliable evaluation of reproducibility, SCC can also be used to quantify differences between Hi-C contact matrices and to determine the optimal sequencing depth for a desired resolution. The measure consistently shows higher accuracy than existing approaches in distinguishing subtle differences in reproducibility and depicting interrelationships of cell lineages. The proposed measure is straightforward to interpret and easy to compute, making it well-suited for providing standardized, interpretable, automatable, and scalable quality control. The freely available R package HiCRep implements our approach. |
format | Online Article Text |
id | pubmed-5668950 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-56689502018-05-01 HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient Yang, Tao Zhang, Feipeng Yardımcı, Galip Gürkan Song, Fan Hardison, Ross C. Noble, William Stafford Yue, Feng Li, Qunhua Genome Res Method Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods for assessing Hi-C data reproducibility can produce misleading results because they ignore spatial features in Hi-C data, such as domain structure and distance dependence. We present HiCRep, a framework for assessing the reproducibility of Hi-C data that systematically accounts for these features. In particular, we introduce a novel similarity measure, the stratum adjusted correlation coefficient (SCC), for quantifying the similarity between Hi-C interaction matrices. Not only does it provide a statistically sound and reliable evaluation of reproducibility, SCC can also be used to quantify differences between Hi-C contact matrices and to determine the optimal sequencing depth for a desired resolution. The measure consistently shows higher accuracy than existing approaches in distinguishing subtle differences in reproducibility and depicting interrelationships of cell lineages. The proposed measure is straightforward to interpret and easy to compute, making it well-suited for providing standardized, interpretable, automatable, and scalable quality control. The freely available R package HiCRep implements our approach. Cold Spring Harbor Laboratory Press 2017-11 /pmc/articles/PMC5668950/ /pubmed/28855260 http://dx.doi.org/10.1101/gr.220640.117 Text en © 2017 Yang et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Method Yang, Tao Zhang, Feipeng Yardımcı, Galip Gürkan Song, Fan Hardison, Ross C. Noble, William Stafford Yue, Feng Li, Qunhua HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient |
title | HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient |
title_full | HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient |
title_fullStr | HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient |
title_full_unstemmed | HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient |
title_short | HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient |
title_sort | hicrep: assessing the reproducibility of hi-c data using a stratum-adjusted correlation coefficient |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5668950/ https://www.ncbi.nlm.nih.gov/pubmed/28855260 http://dx.doi.org/10.1101/gr.220640.117 |
work_keys_str_mv | AT yangtao hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient AT zhangfeipeng hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient AT yardımcıgalipgurkan hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient AT songfan hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient AT hardisonrossc hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient AT noblewilliamstafford hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient AT yuefeng hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient AT liqunhua hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient |