Cargando…

P-value based visualization of codon usage data

Two important and not yet solved problems in bacterial genome research are the identification of horizontally transferred genes and the prediction of gene expression levels. Both problems can be addressed by multivariate analysis of codon usage data. In particular dimensionality reduction methods fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Meinicke, Peter, Brodag, Thomas, Fricke, Wolfgang Florian, Waack, Stephan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1526732/
https://www.ncbi.nlm.nih.gov/pubmed/16808834
http://dx.doi.org/10.1186/1748-7188-1-10
_version_ 1782128938818994176
author Meinicke, Peter
Brodag, Thomas
Fricke, Wolfgang Florian
Waack, Stephan
author_facet Meinicke, Peter
Brodag, Thomas
Fricke, Wolfgang Florian
Waack, Stephan
author_sort Meinicke, Peter
collection PubMed
description Two important and not yet solved problems in bacterial genome research are the identification of horizontally transferred genes and the prediction of gene expression levels. Both problems can be addressed by multivariate analysis of codon usage data. In particular dimensionality reduction methods for visualization of multivariate data have shown to be effective tools for codon usage analysis. We here propose a multidimensional scaling approach using a novel similarity measure for codon usage tables. Our probabilistic similarity measure is based on P-values derived from the well-known chi-square test for comparison of two distributions. Experimental results on four microbial genomes indicate that the new method is well-suited for the analysis of horizontal gene transfer and translational selection. As compared with the widely-used correspondence analysis, our method did not suffer from outlier sensitivity and showed a better clustering of putative alien genes in most cases.
format Text
id pubmed-1526732
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15267322006-08-04 P-value based visualization of codon usage data Meinicke, Peter Brodag, Thomas Fricke, Wolfgang Florian Waack, Stephan Algorithms Mol Biol Research Two important and not yet solved problems in bacterial genome research are the identification of horizontally transferred genes and the prediction of gene expression levels. Both problems can be addressed by multivariate analysis of codon usage data. In particular dimensionality reduction methods for visualization of multivariate data have shown to be effective tools for codon usage analysis. We here propose a multidimensional scaling approach using a novel similarity measure for codon usage tables. Our probabilistic similarity measure is based on P-values derived from the well-known chi-square test for comparison of two distributions. Experimental results on four microbial genomes indicate that the new method is well-suited for the analysis of horizontal gene transfer and translational selection. As compared with the widely-used correspondence analysis, our method did not suffer from outlier sensitivity and showed a better clustering of putative alien genes in most cases. BioMed Central 2006-06-29 /pmc/articles/PMC1526732/ /pubmed/16808834 http://dx.doi.org/10.1186/1748-7188-1-10 Text en Copyright © 2006 Meinicke et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Meinicke, Peter
Brodag, Thomas
Fricke, Wolfgang Florian
Waack, Stephan
P-value based visualization of codon usage data
title P-value based visualization of codon usage data
title_full P-value based visualization of codon usage data
title_fullStr P-value based visualization of codon usage data
title_full_unstemmed P-value based visualization of codon usage data
title_short P-value based visualization of codon usage data
title_sort p-value based visualization of codon usage data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1526732/
https://www.ncbi.nlm.nih.gov/pubmed/16808834
http://dx.doi.org/10.1186/1748-7188-1-10
work_keys_str_mv AT meinickepeter pvaluebasedvisualizationofcodonusagedata
AT brodagthomas pvaluebasedvisualizationofcodonusagedata
AT frickewolfgangflorian pvaluebasedvisualizationofcodonusagedata
AT waackstephan pvaluebasedvisualizationofcodonusagedata