Cargando…
Unsupervised embedding of single-cell Hi-C data
MOTIVATION: Single-cell Hi-C (scHi-C) data promises to enable scientists to interrogate the 3D architecture of DNA in the nucleus of the cell, studying how this structure varies stochastically or along developmental or cell-cycle axes. However, Hi-C data analysis requires methods that take into acco...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022597/ https://www.ncbi.nlm.nih.gov/pubmed/29950005 http://dx.doi.org/10.1093/bioinformatics/bty285 |
_version_ | 1783335712248037376 |
---|---|
author | Liu, Jie Lin, Dejun Yardımcı, Galip Gürkan Noble, William Stafford |
author_facet | Liu, Jie Lin, Dejun Yardımcı, Galip Gürkan Noble, William Stafford |
author_sort | Liu, Jie |
collection | PubMed |
description | MOTIVATION: Single-cell Hi-C (scHi-C) data promises to enable scientists to interrogate the 3D architecture of DNA in the nucleus of the cell, studying how this structure varies stochastically or along developmental or cell-cycle axes. However, Hi-C data analysis requires methods that take into account the unique characteristics of this type of data. In this work, we explore whether methods that have been developed previously for the analysis of bulk Hi-C data can be applied to scHi-C data. We apply methods designed for analysis of bulk Hi-C data to scHi-C data in conjunction with unsupervised embedding. RESULTS: We find that one of these methods, HiCRep, when used in conjunction with multidimensional scaling (MDS), strongly outperforms three other methods, including a technique that has been used previously for scHi-C analysis. We also provide evidence that the HiCRep/MDS method is robust to extremely low per-cell sequencing depth, that this robustness is improved even further when high-coverage and low-coverage cells are projected together, and that the method can be used to jointly embed cells from multiple published datasets. |
format | Online Article Text |
id | pubmed-6022597 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-60225972018-07-10 Unsupervised embedding of single-cell Hi-C data Liu, Jie Lin, Dejun Yardımcı, Galip Gürkan Noble, William Stafford Bioinformatics Ismb 2018–Intelligent Systems for Molecular Biology Proceedings MOTIVATION: Single-cell Hi-C (scHi-C) data promises to enable scientists to interrogate the 3D architecture of DNA in the nucleus of the cell, studying how this structure varies stochastically or along developmental or cell-cycle axes. However, Hi-C data analysis requires methods that take into account the unique characteristics of this type of data. In this work, we explore whether methods that have been developed previously for the analysis of bulk Hi-C data can be applied to scHi-C data. We apply methods designed for analysis of bulk Hi-C data to scHi-C data in conjunction with unsupervised embedding. RESULTS: We find that one of these methods, HiCRep, when used in conjunction with multidimensional scaling (MDS), strongly outperforms three other methods, including a technique that has been used previously for scHi-C analysis. We also provide evidence that the HiCRep/MDS method is robust to extremely low per-cell sequencing depth, that this robustness is improved even further when high-coverage and low-coverage cells are projected together, and that the method can be used to jointly embed cells from multiple published datasets. Oxford University Press 2018-07-01 2018-06-27 /pmc/articles/PMC6022597/ /pubmed/29950005 http://dx.doi.org/10.1093/bioinformatics/bty285 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb 2018–Intelligent Systems for Molecular Biology Proceedings Liu, Jie Lin, Dejun Yardımcı, Galip Gürkan Noble, William Stafford Unsupervised embedding of single-cell Hi-C data |
title | Unsupervised embedding of single-cell Hi-C data |
title_full | Unsupervised embedding of single-cell Hi-C data |
title_fullStr | Unsupervised embedding of single-cell Hi-C data |
title_full_unstemmed | Unsupervised embedding of single-cell Hi-C data |
title_short | Unsupervised embedding of single-cell Hi-C data |
title_sort | unsupervised embedding of single-cell hi-c data |
topic | Ismb 2018–Intelligent Systems for Molecular Biology Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022597/ https://www.ncbi.nlm.nih.gov/pubmed/29950005 http://dx.doi.org/10.1093/bioinformatics/bty285 |
work_keys_str_mv | AT liujie unsupervisedembeddingofsinglecellhicdata AT lindejun unsupervisedembeddingofsinglecellhicdata AT yardımcıgalipgurkan unsupervisedembeddingofsinglecellhicdata AT noblewilliamstafford unsupervisedembeddingofsinglecellhicdata |