Cargando…

Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes

Oligonucleotide usage in archaeal and bacterial genomes can be linked to a number of properties, including codon usage (trinucleotides), DNA base-stacking energy (dinucleotides), and DNA structural conformation (di- to tetranucleotides). We wanted to assess the statistical information potential of d...

Descripción completa

Detalles Bibliográficos
Autores principales: Bohlin, Jon, Skjerve, Eystein, Ussery, David W.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2289840/
https://www.ncbi.nlm.nih.gov/pubmed/18421372
http://dx.doi.org/10.1371/journal.pcbi.1000057
_version_ 1782152346822770688
author Bohlin, Jon
Skjerve, Eystein
Ussery, David W.
author_facet Bohlin, Jon
Skjerve, Eystein
Ussery, David W.
author_sort Bohlin, Jon
collection PubMed
description Oligonucleotide usage in archaeal and bacterial genomes can be linked to a number of properties, including codon usage (trinucleotides), DNA base-stacking energy (dinucleotides), and DNA structural conformation (di- to tetranucleotides). We wanted to assess the statistical information potential of different DNA ‘word-sizes’ and explore how oligonucleotide frequencies differ in coding and non-coding regions. In addition, we used oligonucleotide frequencies to investigate DNA composition and how DNA sequence patterns change within and between prokaryotic organisms. Among the results found was that prokaryotic chromosomes can be described by hexanucleotide frequencies, suggesting that prokaryotic DNA is predominantly short range correlated, i.e., information in prokaryotic genomes is encoded in short oligonucleotides. Oligonucleotide usage varied more within AT-rich and host-associated genomes than in GC-rich and free-living genomes, and this variation was mainly located in non-coding regions. Bias (selectional pressure) in tetranucleotide usage correlated with GC content, and coding regions were more biased than non-coding regions. Non-coding regions were also found to be approximately 5.5% more AT-rich than coding regions, on average, in the 402 chromosomes examined. Pronounced DNA compositional differences were found both within and between AT-rich and GC-rich genomes. GC-rich genomes were more similar and biased in terms of tetranucleotide usage in non-coding regions than AT-rich genomes. The differences found between AT-rich and GC-rich genomes may possibly be attributed to lifestyle, since tetranucleotide usage within host-associated bacteria was, on average, more dissimilar and less biased than free-living archaea and bacteria.
format Text
id pubmed-2289840
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-22898402008-04-18 Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes Bohlin, Jon Skjerve, Eystein Ussery, David W. PLoS Comput Biol Research Article Oligonucleotide usage in archaeal and bacterial genomes can be linked to a number of properties, including codon usage (trinucleotides), DNA base-stacking energy (dinucleotides), and DNA structural conformation (di- to tetranucleotides). We wanted to assess the statistical information potential of different DNA ‘word-sizes’ and explore how oligonucleotide frequencies differ in coding and non-coding regions. In addition, we used oligonucleotide frequencies to investigate DNA composition and how DNA sequence patterns change within and between prokaryotic organisms. Among the results found was that prokaryotic chromosomes can be described by hexanucleotide frequencies, suggesting that prokaryotic DNA is predominantly short range correlated, i.e., information in prokaryotic genomes is encoded in short oligonucleotides. Oligonucleotide usage varied more within AT-rich and host-associated genomes than in GC-rich and free-living genomes, and this variation was mainly located in non-coding regions. Bias (selectional pressure) in tetranucleotide usage correlated with GC content, and coding regions were more biased than non-coding regions. Non-coding regions were also found to be approximately 5.5% more AT-rich than coding regions, on average, in the 402 chromosomes examined. Pronounced DNA compositional differences were found both within and between AT-rich and GC-rich genomes. GC-rich genomes were more similar and biased in terms of tetranucleotide usage in non-coding regions than AT-rich genomes. The differences found between AT-rich and GC-rich genomes may possibly be attributed to lifestyle, since tetranucleotide usage within host-associated bacteria was, on average, more dissimilar and less biased than free-living archaea and bacteria. Public Library of Science 2008-04-18 /pmc/articles/PMC2289840/ /pubmed/18421372 http://dx.doi.org/10.1371/journal.pcbi.1000057 Text en Bohlin et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Bohlin, Jon
Skjerve, Eystein
Ussery, David W.
Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes
title Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes
title_full Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes
title_fullStr Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes
title_full_unstemmed Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes
title_short Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes
title_sort investigations of oligonucleotide usage variance within and between prokaryotes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2289840/
https://www.ncbi.nlm.nih.gov/pubmed/18421372
http://dx.doi.org/10.1371/journal.pcbi.1000057
work_keys_str_mv AT bohlinjon investigationsofoligonucleotideusagevariancewithinandbetweenprokaryotes
AT skjerveeystein investigationsofoligonucleotideusagevariancewithinandbetweenprokaryotes
AT usserydavidw investigationsofoligonucleotideusagevariancewithinandbetweenprokaryotes