Cargando…

Reconstruction of 3D genome architecture via a two-stage algorithm

BACKGROUND: The three-dimensional (3D) configuration of chromosomes within the eukaryote nucleus is an important factor for several cellular functions, including gene expression regulation, and has also been linked with cancer-causing translocation events. While visualization of such architecture re...

Descripción completa

Detalles Bibliográficos
Autores principales: Segal, Mark R., Bengtsson, Henrik L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4638111/
https://www.ncbi.nlm.nih.gov/pubmed/26553003
http://dx.doi.org/10.1186/s12859-015-0799-2
_version_ 1782399882228662272
author Segal, Mark R.
Bengtsson, Henrik L.
author_facet Segal, Mark R.
Bengtsson, Henrik L.
author_sort Segal, Mark R.
collection PubMed
description BACKGROUND: The three-dimensional (3D) configuration of chromosomes within the eukaryote nucleus is an important factor for several cellular functions, including gene expression regulation, and has also been linked with cancer-causing translocation events. While visualization of such architecture remains limited to low resolutions, the ability to infer structures at increasing resolutions has been enabled by recently-devised chromosome conformation capture techniques. In particular, when coupled with next generation sequencing, such methods yield an inventory of genome-wide chromatin contacts or interactions. Various algorithms have been advanced to operate on such contact data to produce reconstructed 3D configurations. Studies have shown that these reconstructions can provide added value over raw interaction data with respect to downstream biological insights. However, only limited, low-resolution reconstructions have been realized for mammals due to computational bottlenecks. RESULTS: Here we propose a two-stage algorithm to partially overcome these computational barriers. The central idea is to initially utilize existing reconstruction techniques on an individual chromosome basis, using intra-chromosomal contacts, and then to relatively position these chromosome-level reconstructions using inter-chromosomal contacts. This two-stage strategy represents a natural approach in view of the within- versus between- chromosome distribution of contacts. It can increase resolution ≈ 20 fold for mouse and human. After describing the algorithm we present 3D architectures for mouse embryonic stem cells and human lymphoblastoid cells. We evaluate the impact of several factors on reconstruction reproducibility and explore a variety of sampling schemes. We further analyze replicate data at differing resolutions obtained from recently devised in situ Hi-C assays. In all instances we demonstrate insensitivity of the whole-genome 3D reconstruction obtained by the two-stage algorithm to the sampling strategy used. CONCLUSIONS: Our two-stage algorithm has the potential to significantly increase the resolution of 3D genome reconstructions. The improvements are such that we can progress from 1 Mb resolution to 100 kb resolution, notable since this latter value has been identified as critical to inferring topological domains in analyses performed on the contact (rather than 3D) level.
format Online
Article
Text
id pubmed-4638111
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46381112015-11-10 Reconstruction of 3D genome architecture via a two-stage algorithm Segal, Mark R. Bengtsson, Henrik L. BMC Bioinformatics Research Article BACKGROUND: The three-dimensional (3D) configuration of chromosomes within the eukaryote nucleus is an important factor for several cellular functions, including gene expression regulation, and has also been linked with cancer-causing translocation events. While visualization of such architecture remains limited to low resolutions, the ability to infer structures at increasing resolutions has been enabled by recently-devised chromosome conformation capture techniques. In particular, when coupled with next generation sequencing, such methods yield an inventory of genome-wide chromatin contacts or interactions. Various algorithms have been advanced to operate on such contact data to produce reconstructed 3D configurations. Studies have shown that these reconstructions can provide added value over raw interaction data with respect to downstream biological insights. However, only limited, low-resolution reconstructions have been realized for mammals due to computational bottlenecks. RESULTS: Here we propose a two-stage algorithm to partially overcome these computational barriers. The central idea is to initially utilize existing reconstruction techniques on an individual chromosome basis, using intra-chromosomal contacts, and then to relatively position these chromosome-level reconstructions using inter-chromosomal contacts. This two-stage strategy represents a natural approach in view of the within- versus between- chromosome distribution of contacts. It can increase resolution ≈ 20 fold for mouse and human. After describing the algorithm we present 3D architectures for mouse embryonic stem cells and human lymphoblastoid cells. We evaluate the impact of several factors on reconstruction reproducibility and explore a variety of sampling schemes. We further analyze replicate data at differing resolutions obtained from recently devised in situ Hi-C assays. In all instances we demonstrate insensitivity of the whole-genome 3D reconstruction obtained by the two-stage algorithm to the sampling strategy used. CONCLUSIONS: Our two-stage algorithm has the potential to significantly increase the resolution of 3D genome reconstructions. The improvements are such that we can progress from 1 Mb resolution to 100 kb resolution, notable since this latter value has been identified as critical to inferring topological domains in analyses performed on the contact (rather than 3D) level. BioMed Central 2015-11-09 /pmc/articles/PMC4638111/ /pubmed/26553003 http://dx.doi.org/10.1186/s12859-015-0799-2 Text en © Segal and Bengtsson. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Segal, Mark R.
Bengtsson, Henrik L.
Reconstruction of 3D genome architecture via a two-stage algorithm
title Reconstruction of 3D genome architecture via a two-stage algorithm
title_full Reconstruction of 3D genome architecture via a two-stage algorithm
title_fullStr Reconstruction of 3D genome architecture via a two-stage algorithm
title_full_unstemmed Reconstruction of 3D genome architecture via a two-stage algorithm
title_short Reconstruction of 3D genome architecture via a two-stage algorithm
title_sort reconstruction of 3d genome architecture via a two-stage algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4638111/
https://www.ncbi.nlm.nih.gov/pubmed/26553003
http://dx.doi.org/10.1186/s12859-015-0799-2
work_keys_str_mv AT segalmarkr reconstructionof3dgenomearchitectureviaatwostagealgorithm
AT bengtssonhenrikl reconstructionof3dgenomearchitectureviaatwostagealgorithm