Cargando…

How repetitive are genomes?

BACKGROUND: Genome sequences vary strongly in their repetitiveness and the causes for this are still debated. Here we propose a novel measure of genome repetitiveness, the index of repetitiveness, I(r), which can be computed in time proportional to the length of the sequences analyzed. We apply it t...

Descripción completa

Detalles Bibliográficos
Autores principales:	Haubold, Bernhard, Wiehe, Thomas
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2006
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1769404/ https://www.ncbi.nlm.nih.gov/pubmed/17187668 http://dx.doi.org/10.1186/1471-2105-7-541

_version_	1782131687960870912
author	Haubold, Bernhard Wiehe, Thomas
author_facet	Haubold, Bernhard Wiehe, Thomas
author_sort	Haubold, Bernhard
collection	PubMed
description	BACKGROUND: Genome sequences vary strongly in their repetitiveness and the causes for this are still debated. Here we propose a novel measure of genome repetitiveness, the index of repetitiveness, I(r), which can be computed in time proportional to the length of the sequences analyzed. We apply it to 336 genomes from all three domains of life. RESULTS: The expected value of I(r )is zero for random sequences of any G/C content and greater than zero for sequences with excess repeats. We find that the I(r )of archaea is significantly smaller than that of eubacteria, which in turn is smaller than that of eukaryotes. Mouse chromosomes have a significantly higher I(r )than human chromosomes and within each genome the Y chromosome is most repetitive. A sliding window analysis reveals that the human HOXA cluster and two surrounding genes are characterized by local minima in I(r). A program for calculating the I(r )is freely available at . CONCLUSION: The general measure of DNA repetitiveness proposed in this paper can be efficiently computed on a genomic scale. This reveals a broad spectrum of repetitiveness among diverse genomes which agrees qualitatively with previous studies of repeat content. A sliding window analysis helps to analyze the intragenomic distribution of repeats.
format	Text
id	pubmed-1769404
institution	National Center for Biotechnology Information
language	English
publishDate	2006
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-17694042007-01-16 How repetitive are genomes? Haubold, Bernhard Wiehe, Thomas BMC Bioinformatics Research Article BACKGROUND: Genome sequences vary strongly in their repetitiveness and the causes for this are still debated. Here we propose a novel measure of genome repetitiveness, the index of repetitiveness, I(r), which can be computed in time proportional to the length of the sequences analyzed. We apply it to 336 genomes from all three domains of life. RESULTS: The expected value of I(r )is zero for random sequences of any G/C content and greater than zero for sequences with excess repeats. We find that the I(r )of archaea is significantly smaller than that of eubacteria, which in turn is smaller than that of eukaryotes. Mouse chromosomes have a significantly higher I(r )than human chromosomes and within each genome the Y chromosome is most repetitive. A sliding window analysis reveals that the human HOXA cluster and two surrounding genes are characterized by local minima in I(r). A program for calculating the I(r )is freely available at . CONCLUSION: The general measure of DNA repetitiveness proposed in this paper can be efficiently computed on a genomic scale. This reveals a broad spectrum of repetitiveness among diverse genomes which agrees qualitatively with previous studies of repeat content. A sliding window analysis helps to analyze the intragenomic distribution of repeats. BioMed Central 2006-12-22 /pmc/articles/PMC1769404/ /pubmed/17187668 http://dx.doi.org/10.1186/1471-2105-7-541 Text en Copyright © 2006 Haubold and Wiehe; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Haubold, Bernhard Wiehe, Thomas How repetitive are genomes?
title	How repetitive are genomes?
title_full	How repetitive are genomes?
title_fullStr	How repetitive are genomes?
title_full_unstemmed	How repetitive are genomes?
title_short	How repetitive are genomes?
title_sort	how repetitive are genomes?
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1769404/ https://www.ncbi.nlm.nih.gov/pubmed/17187668 http://dx.doi.org/10.1186/1471-2105-7-541
work_keys_str_mv	AT hauboldbernhard howrepetitivearegenomes AT wiehethomas howrepetitivearegenomes

How repetitive are genomes?

Ejemplares similares