Cargando…

Unsupervised embedding of single-cell Hi-C data

MOTIVATION: Single-cell Hi-C (scHi-C) data promises to enable scientists to interrogate the 3D architecture of DNA in the nucleus of the cell, studying how this structure varies stochastically or along developmental or cell-cycle axes. However, Hi-C data analysis requires methods that take into acco...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jie, Lin, Dejun, Yardımcı, Galip Gürkan, Noble, William Stafford
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022597/
https://www.ncbi.nlm.nih.gov/pubmed/29950005
http://dx.doi.org/10.1093/bioinformatics/bty285
_version_ 1783335712248037376
author Liu, Jie
Lin, Dejun
Yardımcı, Galip Gürkan
Noble, William Stafford
author_facet Liu, Jie
Lin, Dejun
Yardımcı, Galip Gürkan
Noble, William Stafford
author_sort Liu, Jie
collection PubMed
description MOTIVATION: Single-cell Hi-C (scHi-C) data promises to enable scientists to interrogate the 3D architecture of DNA in the nucleus of the cell, studying how this structure varies stochastically or along developmental or cell-cycle axes. However, Hi-C data analysis requires methods that take into account the unique characteristics of this type of data. In this work, we explore whether methods that have been developed previously for the analysis of bulk Hi-C data can be applied to scHi-C data. We apply methods designed for analysis of bulk Hi-C data to scHi-C data in conjunction with unsupervised embedding. RESULTS: We find that one of these methods, HiCRep, when used in conjunction with multidimensional scaling (MDS), strongly outperforms three other methods, including a technique that has been used previously for scHi-C analysis. We also provide evidence that the HiCRep/MDS method is robust to extremely low per-cell sequencing depth, that this robustness is improved even further when high-coverage and low-coverage cells are projected together, and that the method can be used to jointly embed cells from multiple published datasets.
format Online
Article
Text
id pubmed-6022597
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60225972018-07-10 Unsupervised embedding of single-cell Hi-C data Liu, Jie Lin, Dejun Yardımcı, Galip Gürkan Noble, William Stafford Bioinformatics Ismb 2018–Intelligent Systems for Molecular Biology Proceedings MOTIVATION: Single-cell Hi-C (scHi-C) data promises to enable scientists to interrogate the 3D architecture of DNA in the nucleus of the cell, studying how this structure varies stochastically or along developmental or cell-cycle axes. However, Hi-C data analysis requires methods that take into account the unique characteristics of this type of data. In this work, we explore whether methods that have been developed previously for the analysis of bulk Hi-C data can be applied to scHi-C data. We apply methods designed for analysis of bulk Hi-C data to scHi-C data in conjunction with unsupervised embedding. RESULTS: We find that one of these methods, HiCRep, when used in conjunction with multidimensional scaling (MDS), strongly outperforms three other methods, including a technique that has been used previously for scHi-C analysis. We also provide evidence that the HiCRep/MDS method is robust to extremely low per-cell sequencing depth, that this robustness is improved even further when high-coverage and low-coverage cells are projected together, and that the method can be used to jointly embed cells from multiple published datasets. Oxford University Press 2018-07-01 2018-06-27 /pmc/articles/PMC6022597/ /pubmed/29950005 http://dx.doi.org/10.1093/bioinformatics/bty285 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb 2018–Intelligent Systems for Molecular Biology Proceedings
Liu, Jie
Lin, Dejun
Yardımcı, Galip Gürkan
Noble, William Stafford
Unsupervised embedding of single-cell Hi-C data
title Unsupervised embedding of single-cell Hi-C data
title_full Unsupervised embedding of single-cell Hi-C data
title_fullStr Unsupervised embedding of single-cell Hi-C data
title_full_unstemmed Unsupervised embedding of single-cell Hi-C data
title_short Unsupervised embedding of single-cell Hi-C data
title_sort unsupervised embedding of single-cell hi-c data
topic Ismb 2018–Intelligent Systems for Molecular Biology Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022597/
https://www.ncbi.nlm.nih.gov/pubmed/29950005
http://dx.doi.org/10.1093/bioinformatics/bty285
work_keys_str_mv AT liujie unsupervisedembeddingofsinglecellhicdata
AT lindejun unsupervisedembeddingofsinglecellhicdata
AT yardımcıgalipgurkan unsupervisedembeddingofsinglecellhicdata
AT noblewilliamstafford unsupervisedembeddingofsinglecellhicdata