Cargando…

Amplification-free library preparation with SAFE Hi-C uses ligation products for deep sequencing to improve traditional Hi-C analysis

PCR amplification of Hi-C libraries introduces unusable duplicates and results in a biased representation of chromatin interactions. We present a simplified, fast, and economically efficient Hi-C library preparation procedure, SAFE Hi-C, which generates sufficient non-amplified ligation products for...

Descripción completa

Detalles Bibliográficos
Autores principales: Niu, Longjian, Shen, Wei, Huang, Yingzhang, He, Na, Zhang, Yuedong, Sun, Jialei, Wan, Jing, Jiang, Daxin, Yang, Manyun, Tse, Yu Chung, Li, Li, Hou, Chunhui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6642088/
https://www.ncbi.nlm.nih.gov/pubmed/31341966
http://dx.doi.org/10.1038/s42003-019-0519-y
Descripción
Sumario:PCR amplification of Hi-C libraries introduces unusable duplicates and results in a biased representation of chromatin interactions. We present a simplified, fast, and economically efficient Hi-C library preparation procedure, SAFE Hi-C, which generates sufficient non-amplified ligation products for deep sequencing from 30 million Drosophila cells. Comprehensive analysis of the resulting data shows that amplification-free Hi-C preserves higher complexity of chromatin interaction and lowers sequencing depth for the same number of unique paired reads. For human cells which have a large genome, SAFE Hi-C recovers enough ligated fragments for direct high-throughput sequencing without amplification from as few as 250,000 cells. Comparison with published in situ Hi-C data from millions of human cells demonstrates that amplification introduces distance-dependent amplification bias, which results in an increased background noise level against genomic distance. With amplification bias avoided, SAFE Hi-C may produce a chromatin interaction network more faithfully reflecting the real three-dimensional genomic architecture.