Cargando…

Implications of human genome structural heterogeneity: functionally related genes tend to reside in organizationally similar genomic regions

BACKGROUND: In an earlier study, we hypothesized that genomic segments with different sequence organization patterns (OPs) might display functional specificity despite their similar GC content. Here we tested this hypothesis by dividing the human genome into 100 kb segments, classifying these segmen...

Descripción completa

Detalles Bibliográficos
Autores principales: Paz, Arnon, Frenkel, Svetlana, Snir, Sagi, Kirzhner, Valery, Korol, Abraham B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234528/
https://www.ncbi.nlm.nih.gov/pubmed/24684786
http://dx.doi.org/10.1186/1471-2164-15-252
_version_ 1782344880009248768
author Paz, Arnon
Frenkel, Svetlana
Snir, Sagi
Kirzhner, Valery
Korol, Abraham B
author_facet Paz, Arnon
Frenkel, Svetlana
Snir, Sagi
Kirzhner, Valery
Korol, Abraham B
author_sort Paz, Arnon
collection PubMed
description BACKGROUND: In an earlier study, we hypothesized that genomic segments with different sequence organization patterns (OPs) might display functional specificity despite their similar GC content. Here we tested this hypothesis by dividing the human genome into 100 kb segments, classifying these segments into five compositional groups according to GC content, and then characterizing each segment within the five groups by oligonucleotide counting (k-mer analysis; also referred to as compositional spectrum analysis, or CSA), to examine the distribution of sequence OPs in the segments. We performed the CSA on the entire DNA, i.e., its coding and non-coding parts the latter being much more abundant in the genome than the former. RESULTS: We identified 38 OP-type clusters of segments that differ in their compositional spectrum (CS) organization. Many of the segments that shared the same OP type were enriched with genes related to the same biological processes (developmental, signaling, etc.), components of biochemical complexes, or organelles. Thirteen OP-type clusters showed significant enrichment in genes connected to specific gene-ontology terms. Some of these clusters seemed to reflect certain events during periods of horizontal gene transfer and genome expansion, and subsequent evolution of genomic regions requiring coordinated regulation. CONCLUSIONS: There may be a tendency for genes that are involved in the same biological process, complex or organelle to use the same OP, even at a distance of ~ 100 kb from the genes. Although the intergenic DNA is non-coding, the general pattern of sequence organization (e.g., reflected in over-represented oligonucleotide “words”) may be important and were protected, to some extent, in the course of evolution.
format Online
Article
Text
id pubmed-4234528
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42345282014-11-18 Implications of human genome structural heterogeneity: functionally related genes tend to reside in organizationally similar genomic regions Paz, Arnon Frenkel, Svetlana Snir, Sagi Kirzhner, Valery Korol, Abraham B BMC Genomics Research Article BACKGROUND: In an earlier study, we hypothesized that genomic segments with different sequence organization patterns (OPs) might display functional specificity despite their similar GC content. Here we tested this hypothesis by dividing the human genome into 100 kb segments, classifying these segments into five compositional groups according to GC content, and then characterizing each segment within the five groups by oligonucleotide counting (k-mer analysis; also referred to as compositional spectrum analysis, or CSA), to examine the distribution of sequence OPs in the segments. We performed the CSA on the entire DNA, i.e., its coding and non-coding parts the latter being much more abundant in the genome than the former. RESULTS: We identified 38 OP-type clusters of segments that differ in their compositional spectrum (CS) organization. Many of the segments that shared the same OP type were enriched with genes related to the same biological processes (developmental, signaling, etc.), components of biochemical complexes, or organelles. Thirteen OP-type clusters showed significant enrichment in genes connected to specific gene-ontology terms. Some of these clusters seemed to reflect certain events during periods of horizontal gene transfer and genome expansion, and subsequent evolution of genomic regions requiring coordinated regulation. CONCLUSIONS: There may be a tendency for genes that are involved in the same biological process, complex or organelle to use the same OP, even at a distance of ~ 100 kb from the genes. Although the intergenic DNA is non-coding, the general pattern of sequence organization (e.g., reflected in over-represented oligonucleotide “words”) may be important and were protected, to some extent, in the course of evolution. BioMed Central 2014-03-31 /pmc/articles/PMC4234528/ /pubmed/24684786 http://dx.doi.org/10.1186/1471-2164-15-252 Text en Copyright © 2014 Paz et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Research Article
Paz, Arnon
Frenkel, Svetlana
Snir, Sagi
Kirzhner, Valery
Korol, Abraham B
Implications of human genome structural heterogeneity: functionally related genes tend to reside in organizationally similar genomic regions
title Implications of human genome structural heterogeneity: functionally related genes tend to reside in organizationally similar genomic regions
title_full Implications of human genome structural heterogeneity: functionally related genes tend to reside in organizationally similar genomic regions
title_fullStr Implications of human genome structural heterogeneity: functionally related genes tend to reside in organizationally similar genomic regions
title_full_unstemmed Implications of human genome structural heterogeneity: functionally related genes tend to reside in organizationally similar genomic regions
title_short Implications of human genome structural heterogeneity: functionally related genes tend to reside in organizationally similar genomic regions
title_sort implications of human genome structural heterogeneity: functionally related genes tend to reside in organizationally similar genomic regions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234528/
https://www.ncbi.nlm.nih.gov/pubmed/24684786
http://dx.doi.org/10.1186/1471-2164-15-252
work_keys_str_mv AT pazarnon implicationsofhumangenomestructuralheterogeneityfunctionallyrelatedgenestendtoresideinorganizationallysimilargenomicregions
AT frenkelsvetlana implicationsofhumangenomestructuralheterogeneityfunctionallyrelatedgenestendtoresideinorganizationallysimilargenomicregions
AT snirsagi implicationsofhumangenomestructuralheterogeneityfunctionallyrelatedgenestendtoresideinorganizationallysimilargenomicregions
AT kirzhnervalery implicationsofhumangenomestructuralheterogeneityfunctionallyrelatedgenestendtoresideinorganizationallysimilargenomicregions
AT korolabrahamb implicationsofhumangenomestructuralheterogeneityfunctionallyrelatedgenestendtoresideinorganizationallysimilargenomicregions