Cargando…

HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient

Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods for assessing Hi-C data reproducibility can produce misleading results because they ignore spatial features in Hi-C data, such as domain structure and distance dependence. We present HiCRep, a fra...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Tao, Zhang, Feipeng, Yardımcı, Galip Gürkan, Song, Fan, Hardison, Ross C., Noble, William Stafford, Yue, Feng, Li, Qunhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5668950/
https://www.ncbi.nlm.nih.gov/pubmed/28855260
http://dx.doi.org/10.1101/gr.220640.117
_version_ 1783275766643949568
author Yang, Tao
Zhang, Feipeng
Yardımcı, Galip Gürkan
Song, Fan
Hardison, Ross C.
Noble, William Stafford
Yue, Feng
Li, Qunhua
author_facet Yang, Tao
Zhang, Feipeng
Yardımcı, Galip Gürkan
Song, Fan
Hardison, Ross C.
Noble, William Stafford
Yue, Feng
Li, Qunhua
author_sort Yang, Tao
collection PubMed
description Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods for assessing Hi-C data reproducibility can produce misleading results because they ignore spatial features in Hi-C data, such as domain structure and distance dependence. We present HiCRep, a framework for assessing the reproducibility of Hi-C data that systematically accounts for these features. In particular, we introduce a novel similarity measure, the stratum adjusted correlation coefficient (SCC), for quantifying the similarity between Hi-C interaction matrices. Not only does it provide a statistically sound and reliable evaluation of reproducibility, SCC can also be used to quantify differences between Hi-C contact matrices and to determine the optimal sequencing depth for a desired resolution. The measure consistently shows higher accuracy than existing approaches in distinguishing subtle differences in reproducibility and depicting interrelationships of cell lineages. The proposed measure is straightforward to interpret and easy to compute, making it well-suited for providing standardized, interpretable, automatable, and scalable quality control. The freely available R package HiCRep implements our approach.
format Online
Article
Text
id pubmed-5668950
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-56689502018-05-01 HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient Yang, Tao Zhang, Feipeng Yardımcı, Galip Gürkan Song, Fan Hardison, Ross C. Noble, William Stafford Yue, Feng Li, Qunhua Genome Res Method Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods for assessing Hi-C data reproducibility can produce misleading results because they ignore spatial features in Hi-C data, such as domain structure and distance dependence. We present HiCRep, a framework for assessing the reproducibility of Hi-C data that systematically accounts for these features. In particular, we introduce a novel similarity measure, the stratum adjusted correlation coefficient (SCC), for quantifying the similarity between Hi-C interaction matrices. Not only does it provide a statistically sound and reliable evaluation of reproducibility, SCC can also be used to quantify differences between Hi-C contact matrices and to determine the optimal sequencing depth for a desired resolution. The measure consistently shows higher accuracy than existing approaches in distinguishing subtle differences in reproducibility and depicting interrelationships of cell lineages. The proposed measure is straightforward to interpret and easy to compute, making it well-suited for providing standardized, interpretable, automatable, and scalable quality control. The freely available R package HiCRep implements our approach. Cold Spring Harbor Laboratory Press 2017-11 /pmc/articles/PMC5668950/ /pubmed/28855260 http://dx.doi.org/10.1101/gr.220640.117 Text en © 2017 Yang et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Yang, Tao
Zhang, Feipeng
Yardımcı, Galip Gürkan
Song, Fan
Hardison, Ross C.
Noble, William Stafford
Yue, Feng
Li, Qunhua
HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient
title HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient
title_full HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient
title_fullStr HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient
title_full_unstemmed HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient
title_short HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient
title_sort hicrep: assessing the reproducibility of hi-c data using a stratum-adjusted correlation coefficient
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5668950/
https://www.ncbi.nlm.nih.gov/pubmed/28855260
http://dx.doi.org/10.1101/gr.220640.117
work_keys_str_mv AT yangtao hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient
AT zhangfeipeng hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient
AT yardımcıgalipgurkan hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient
AT songfan hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient
AT hardisonrossc hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient
AT noblewilliamstafford hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient
AT yuefeng hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient
AT liqunhua hicrepassessingthereproducibilityofhicdatausingastratumadjustedcorrelationcoefficient