Cargando…

Fine-structured multi-scaling long-range correlations in completely sequenced genomes—features, origin, and classification

The sequential organization of genomes, i.e. the relations between distant base pairs and regions within sequences, and its connection to the three-dimensional organization of genomes is still a largely unresolved problem. Long-range power-law correlations were found using correlation analysis on al...

Descripción completa

Detalles Bibliográficos
Autores principales: Knoch, Tobias A., Göker, Markus, Lohner, Rudolf, Abuseiris, Anis, Grosveld, Frank G.
Formato: Texto
Lenguaje:English
Publicado: Springer-Verlag 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2701493/
https://www.ncbi.nlm.nih.gov/pubmed/19533117
http://dx.doi.org/10.1007/s00249-009-0489-y
_version_ 1782168704205717504
author Knoch, Tobias A.
Göker, Markus
Lohner, Rudolf
Abuseiris, Anis
Grosveld, Frank G.
author_facet Knoch, Tobias A.
Göker, Markus
Lohner, Rudolf
Abuseiris, Anis
Grosveld, Frank G.
author_sort Knoch, Tobias A.
collection PubMed
description The sequential organization of genomes, i.e. the relations between distant base pairs and regions within sequences, and its connection to the three-dimensional organization of genomes is still a largely unresolved problem. Long-range power-law correlations were found using correlation analysis on almost the entire observable scale of 132 completely sequenced chromosomes of 0.5 × 10(6) to 3.0 × 10(7) bp from Archaea, Bacteria, Arabidopsis thaliana, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster, and Homo sapiens. The local correlation coefficients show a species-specific multi-scaling behaviour: close to random correlations on the scale of a few base pairs, a first maximum from 40 to 3,400 bp (for Arabidopsis thaliana and Drosophila melanogaster divided in two submaxima), and often a region of one or more second maxima from 10(5) to 3 × 10(5) bp. Within this multi-scaling behaviour, an additional fine-structure is present and attributable to codon usage in all except the human sequences, where it is related to nucleosomal binding. Computer-generated random sequences assuming a block organization of genomes, the codon usage, and nucleosomal binding explain these results. Mutation by sequence reshuffling destroyed all correlations. Thus, the stability of correlations seems to be evolutionarily tightly controlled and connected to the spatial genome organization, especially on large scales. In summary, genomes show a complex sequential organization related closely to their three-dimensional organization.
format Text
id pubmed-2701493
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Springer-Verlag
record_format MEDLINE/PubMed
spelling pubmed-27014932009-06-29 Fine-structured multi-scaling long-range correlations in completely sequenced genomes—features, origin, and classification Knoch, Tobias A. Göker, Markus Lohner, Rudolf Abuseiris, Anis Grosveld, Frank G. Eur Biophys J Original Paper The sequential organization of genomes, i.e. the relations between distant base pairs and regions within sequences, and its connection to the three-dimensional organization of genomes is still a largely unresolved problem. Long-range power-law correlations were found using correlation analysis on almost the entire observable scale of 132 completely sequenced chromosomes of 0.5 × 10(6) to 3.0 × 10(7) bp from Archaea, Bacteria, Arabidopsis thaliana, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster, and Homo sapiens. The local correlation coefficients show a species-specific multi-scaling behaviour: close to random correlations on the scale of a few base pairs, a first maximum from 40 to 3,400 bp (for Arabidopsis thaliana and Drosophila melanogaster divided in two submaxima), and often a region of one or more second maxima from 10(5) to 3 × 10(5) bp. Within this multi-scaling behaviour, an additional fine-structure is present and attributable to codon usage in all except the human sequences, where it is related to nucleosomal binding. Computer-generated random sequences assuming a block organization of genomes, the codon usage, and nucleosomal binding explain these results. Mutation by sequence reshuffling destroyed all correlations. Thus, the stability of correlations seems to be evolutionarily tightly controlled and connected to the spatial genome organization, especially on large scales. In summary, genomes show a complex sequential organization related closely to their three-dimensional organization. Springer-Verlag 2009-06-17 2009-07 /pmc/articles/PMC2701493/ /pubmed/19533117 http://dx.doi.org/10.1007/s00249-009-0489-y Text en © The Author(s) 2009
spellingShingle Original Paper
Knoch, Tobias A.
Göker, Markus
Lohner, Rudolf
Abuseiris, Anis
Grosveld, Frank G.
Fine-structured multi-scaling long-range correlations in completely sequenced genomes—features, origin, and classification
title Fine-structured multi-scaling long-range correlations in completely sequenced genomes—features, origin, and classification
title_full Fine-structured multi-scaling long-range correlations in completely sequenced genomes—features, origin, and classification
title_fullStr Fine-structured multi-scaling long-range correlations in completely sequenced genomes—features, origin, and classification
title_full_unstemmed Fine-structured multi-scaling long-range correlations in completely sequenced genomes—features, origin, and classification
title_short Fine-structured multi-scaling long-range correlations in completely sequenced genomes—features, origin, and classification
title_sort fine-structured multi-scaling long-range correlations in completely sequenced genomes—features, origin, and classification
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2701493/
https://www.ncbi.nlm.nih.gov/pubmed/19533117
http://dx.doi.org/10.1007/s00249-009-0489-y
work_keys_str_mv AT knochtobiasa finestructuredmultiscalinglongrangecorrelationsincompletelysequencedgenomesfeaturesoriginandclassification
AT gokermarkus finestructuredmultiscalinglongrangecorrelationsincompletelysequencedgenomesfeaturesoriginandclassification
AT lohnerrudolf finestructuredmultiscalinglongrangecorrelationsincompletelysequencedgenomesfeaturesoriginandclassification
AT abuseirisanis finestructuredmultiscalinglongrangecorrelationsincompletelysequencedgenomesfeaturesoriginandclassification
AT grosveldfrankg finestructuredmultiscalinglongrangecorrelationsincompletelysequencedgenomesfeaturesoriginandclassification