Cargando…

Codon Deviation Coefficient: a novel measure for estimating codon usage bias and its statistical significance

BACKGROUND: Genetic mutation, selective pressure for translational efficiency and accuracy, level of gene expression, and protein function through natural selection are all believed to lead to codon usage bias (CUB). Therefore, informative measurement of CUB is of fundamental importance to making in...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zhang, Li, Jun, Cui, Peng, Ding, Feng, Li, Ang, Townsend, Jeffrey P, Yu, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3368730/
https://www.ncbi.nlm.nih.gov/pubmed/22435713
http://dx.doi.org/10.1186/1471-2105-13-43
_version_ 1782234977806581760
author Zhang, Zhang
Li, Jun
Cui, Peng
Ding, Feng
Li, Ang
Townsend, Jeffrey P
Yu, Jun
author_facet Zhang, Zhang
Li, Jun
Cui, Peng
Ding, Feng
Li, Ang
Townsend, Jeffrey P
Yu, Jun
author_sort Zhang, Zhang
collection PubMed
description BACKGROUND: Genetic mutation, selective pressure for translational efficiency and accuracy, level of gene expression, and protein function through natural selection are all believed to lead to codon usage bias (CUB). Therefore, informative measurement of CUB is of fundamental importance to making inferences regarding gene function and genome evolution. However, extant measures of CUB have not fully accounted for the quantitative effect of background nucleotide composition and have not statistically evaluated the significance of CUB in sequence analysis. RESULTS: Here we propose a novel measure--Codon Deviation Coefficient (CDC)--that provides an informative measurement of CUB and its statistical significance without requiring any prior knowledge. Unlike previous measures, CDC estimates CUB by accounting for background nucleotide compositions tailored to codon positions and adopts the bootstrapping to assess the statistical significance of CUB for any given sequence. We evaluate CDC by examining its effectiveness on simulated sequences and empirical data and show that CDC outperforms extant measures by achieving a more informative estimation of CUB and its statistical significance. CONCLUSIONS: As validated by both simulated and empirical data, CDC provides a highly informative quantification of CUB and its statistical significance, useful for determining comparative magnitudes and patterns of biased codon usage for genes or genomes with diverse sequence compositions.
format Online
Article
Text
id pubmed-3368730
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33687302012-06-07 Codon Deviation Coefficient: a novel measure for estimating codon usage bias and its statistical significance Zhang, Zhang Li, Jun Cui, Peng Ding, Feng Li, Ang Townsend, Jeffrey P Yu, Jun BMC Bioinformatics Methodology Article BACKGROUND: Genetic mutation, selective pressure for translational efficiency and accuracy, level of gene expression, and protein function through natural selection are all believed to lead to codon usage bias (CUB). Therefore, informative measurement of CUB is of fundamental importance to making inferences regarding gene function and genome evolution. However, extant measures of CUB have not fully accounted for the quantitative effect of background nucleotide composition and have not statistically evaluated the significance of CUB in sequence analysis. RESULTS: Here we propose a novel measure--Codon Deviation Coefficient (CDC)--that provides an informative measurement of CUB and its statistical significance without requiring any prior knowledge. Unlike previous measures, CDC estimates CUB by accounting for background nucleotide compositions tailored to codon positions and adopts the bootstrapping to assess the statistical significance of CUB for any given sequence. We evaluate CDC by examining its effectiveness on simulated sequences and empirical data and show that CDC outperforms extant measures by achieving a more informative estimation of CUB and its statistical significance. CONCLUSIONS: As validated by both simulated and empirical data, CDC provides a highly informative quantification of CUB and its statistical significance, useful for determining comparative magnitudes and patterns of biased codon usage for genes or genomes with diverse sequence compositions. BioMed Central 2012-03-22 /pmc/articles/PMC3368730/ /pubmed/22435713 http://dx.doi.org/10.1186/1471-2105-13-43 Text en Copyright ©2012 Zhang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Zhang, Zhang
Li, Jun
Cui, Peng
Ding, Feng
Li, Ang
Townsend, Jeffrey P
Yu, Jun
Codon Deviation Coefficient: a novel measure for estimating codon usage bias and its statistical significance
title Codon Deviation Coefficient: a novel measure for estimating codon usage bias and its statistical significance
title_full Codon Deviation Coefficient: a novel measure for estimating codon usage bias and its statistical significance
title_fullStr Codon Deviation Coefficient: a novel measure for estimating codon usage bias and its statistical significance
title_full_unstemmed Codon Deviation Coefficient: a novel measure for estimating codon usage bias and its statistical significance
title_short Codon Deviation Coefficient: a novel measure for estimating codon usage bias and its statistical significance
title_sort codon deviation coefficient: a novel measure for estimating codon usage bias and its statistical significance
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3368730/
https://www.ncbi.nlm.nih.gov/pubmed/22435713
http://dx.doi.org/10.1186/1471-2105-13-43
work_keys_str_mv AT zhangzhang codondeviationcoefficientanovelmeasureforestimatingcodonusagebiasanditsstatisticalsignificance
AT lijun codondeviationcoefficientanovelmeasureforestimatingcodonusagebiasanditsstatisticalsignificance
AT cuipeng codondeviationcoefficientanovelmeasureforestimatingcodonusagebiasanditsstatisticalsignificance
AT dingfeng codondeviationcoefficientanovelmeasureforestimatingcodonusagebiasanditsstatisticalsignificance
AT liang codondeviationcoefficientanovelmeasureforestimatingcodonusagebiasanditsstatisticalsignificance
AT townsendjeffreyp codondeviationcoefficientanovelmeasureforestimatingcodonusagebiasanditsstatisticalsignificance
AT yujun codondeviationcoefficientanovelmeasureforestimatingcodonusagebiasanditsstatisticalsignificance