Cargando…

The most frequent short sequences in non-coding DNA

The purpose of this work is to determine the most frequent short sequences in non-coding DNA. They may play a role in maintaining the structure and function of eukaryotic chromosomes. We present a simple method for the detection and analysis of such sequences in several genomes, including Arabidopsi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Subirana, Juan A., Messeguer, Xavier
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2010
Materias:	Genomics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2831315/ https://www.ncbi.nlm.nih.gov/pubmed/19966278 http://dx.doi.org/10.1093/nar/gkp1094

_version_	1782178236182036480
author	Subirana, Juan A. Messeguer, Xavier
author_facet	Subirana, Juan A. Messeguer, Xavier
author_sort	Subirana, Juan A.
collection	PubMed
description	The purpose of this work is to determine the most frequent short sequences in non-coding DNA. They may play a role in maintaining the structure and function of eukaryotic chromosomes. We present a simple method for the detection and analysis of such sequences in several genomes, including Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens. We also study two chromosomes of man and mouse with a length similar to the whole genomes of the other species. We provide a list of the most common sequences of 9–14 bases in each genome. As expected, they are present in human Alu sequences. Our programs may also give a graph and a list of their position in the genome. Detection of clusters is also possible. In most cases, these sequences contain few alternating regions. Their intrinsic structure and their influence on nucleosome formation are not known. In particular, we have found new features of short sequences in C. elegans, which are distributed in heterogeneous clusters. They appear as punctuation marks in the chromosomes. Such clusters are not found in either A. thaliana or D. melanogaster. We discuss the possibility that they play a role in centromere function and homolog recognition in meiosis.
format	Text
id	pubmed-2831315
institution	National Center for Biotechnology Information
language	English
publishDate	2010
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-28313152010-03-03 The most frequent short sequences in non-coding DNA Subirana, Juan A. Messeguer, Xavier Nucleic Acids Res Genomics The purpose of this work is to determine the most frequent short sequences in non-coding DNA. They may play a role in maintaining the structure and function of eukaryotic chromosomes. We present a simple method for the detection and analysis of such sequences in several genomes, including Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens. We also study two chromosomes of man and mouse with a length similar to the whole genomes of the other species. We provide a list of the most common sequences of 9–14 bases in each genome. As expected, they are present in human Alu sequences. Our programs may also give a graph and a list of their position in the genome. Detection of clusters is also possible. In most cases, these sequences contain few alternating regions. Their intrinsic structure and their influence on nucleosome formation are not known. In particular, we have found new features of short sequences in C. elegans, which are distributed in heterogeneous clusters. They appear as punctuation marks in the chromosomes. Such clusters are not found in either A. thaliana or D. melanogaster. We discuss the possibility that they play a role in centromere function and homolog recognition in meiosis. Oxford University Press 2010-03 2009-12-04 /pmc/articles/PMC2831315/ /pubmed/19966278 http://dx.doi.org/10.1093/nar/gkp1094 Text en © The Author(s) 2009. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Genomics Subirana, Juan A. Messeguer, Xavier The most frequent short sequences in non-coding DNA
title	The most frequent short sequences in non-coding DNA
title_full	The most frequent short sequences in non-coding DNA
title_fullStr	The most frequent short sequences in non-coding DNA
title_full_unstemmed	The most frequent short sequences in non-coding DNA
title_short	The most frequent short sequences in non-coding DNA
title_sort	most frequent short sequences in non-coding dna
topic	Genomics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2831315/ https://www.ncbi.nlm.nih.gov/pubmed/19966278 http://dx.doi.org/10.1093/nar/gkp1094
work_keys_str_mv	AT subiranajuana themostfrequentshortsequencesinnoncodingdna AT messeguerxavier themostfrequentshortsequencesinnoncodingdna AT subiranajuana mostfrequentshortsequencesinnoncodingdna AT messeguerxavier mostfrequentshortsequencesinnoncodingdna

The most frequent short sequences in non-coding DNA

Ejemplares similares