Cargando…

Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity

BACKGROUND: There are a number of methods (also called: measures) currently in use that quantify codon usage in genes. These measures are often influenced by other sequence properties, such as length. This can introduce strong methodological bias into measurements; therefore we attempted to develop...

Descripción completa

Detalles Bibliográficos
Autores principales: Supek, Fran, Vlahoviček, Kristian
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1199580/
https://www.ncbi.nlm.nih.gov/pubmed/16029499
http://dx.doi.org/10.1186/1471-2105-6-182
_version_ 1782124871323484160
author Supek, Fran
Vlahoviček, Kristian
author_facet Supek, Fran
Vlahoviček, Kristian
author_sort Supek, Fran
collection PubMed
description BACKGROUND: There are a number of methods (also called: measures) currently in use that quantify codon usage in genes. These measures are often influenced by other sequence properties, such as length. This can introduce strong methodological bias into measurements; therefore we attempted to develop a method free from such dependencies. One of the common applications of codon usage analyses is to quantitatively predict gene expressivity. RESULTS: We compared the performance of several commonly used measures and a novel method we introduce in this paper – Measure Independent of Length and Composition (MILC). Large, randomly generated sequence sets were used to test for dependence on (i) sequence length, (ii) overall amount of codon bias and (iii) codon bias discrepancy in the sequences. A derivative of the method, named MELP (MILC-based Expression Level Predictor) can be used to quantitatively predict gene expression levels from genomic data. It was compared to other similar predictors by examining their correlation with actual, experimentally obtained mRNA or protein abundances. CONCLUSION: We have established that MILC is a generally applicable measure, being resistant to changes in gene length and overall nucleotide composition, and introducing little noise into measurements. Other methods, however, may also be appropriate in certain applications. Our efforts to quantitatively predict gene expression levels in several prokaryotes and unicellular eukaryotes met with varying levels of success, depending on the experimental dataset and predictor used. Out of all methods, MELP and Rainer Merkl's GCB method had the most consistent behaviour. A 'reference set' containing known ribosomal protein genes appears to be a valid starting point for a codon usage-based expressivity prediction.
format Text
id pubmed-1199580
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11995802005-09-08 Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity Supek, Fran Vlahoviček, Kristian BMC Bioinformatics Methodology Article BACKGROUND: There are a number of methods (also called: measures) currently in use that quantify codon usage in genes. These measures are often influenced by other sequence properties, such as length. This can introduce strong methodological bias into measurements; therefore we attempted to develop a method free from such dependencies. One of the common applications of codon usage analyses is to quantitatively predict gene expressivity. RESULTS: We compared the performance of several commonly used measures and a novel method we introduce in this paper – Measure Independent of Length and Composition (MILC). Large, randomly generated sequence sets were used to test for dependence on (i) sequence length, (ii) overall amount of codon bias and (iii) codon bias discrepancy in the sequences. A derivative of the method, named MELP (MILC-based Expression Level Predictor) can be used to quantitatively predict gene expression levels from genomic data. It was compared to other similar predictors by examining their correlation with actual, experimentally obtained mRNA or protein abundances. CONCLUSION: We have established that MILC is a generally applicable measure, being resistant to changes in gene length and overall nucleotide composition, and introducing little noise into measurements. Other methods, however, may also be appropriate in certain applications. Our efforts to quantitatively predict gene expression levels in several prokaryotes and unicellular eukaryotes met with varying levels of success, depending on the experimental dataset and predictor used. Out of all methods, MELP and Rainer Merkl's GCB method had the most consistent behaviour. A 'reference set' containing known ribosomal protein genes appears to be a valid starting point for a codon usage-based expressivity prediction. BioMed Central 2005-07-19 /pmc/articles/PMC1199580/ /pubmed/16029499 http://dx.doi.org/10.1186/1471-2105-6-182 Text en Copyright © 2005 Supek and Vlahoviček; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Supek, Fran
Vlahoviček, Kristian
Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity
title Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity
title_full Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity
title_fullStr Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity
title_full_unstemmed Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity
title_short Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity
title_sort comparison of codon usage measures and their applicability in prediction of microbial gene expressivity
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1199580/
https://www.ncbi.nlm.nih.gov/pubmed/16029499
http://dx.doi.org/10.1186/1471-2105-6-182
work_keys_str_mv AT supekfran comparisonofcodonusagemeasuresandtheirapplicabilityinpredictionofmicrobialgeneexpressivity
AT vlahovicekkristian comparisonofcodonusagemeasuresandtheirapplicabilityinpredictionofmicrobialgeneexpressivity