Cargando…

Dependency of codon usage on protein sequence patterns: a statistical study

BACKGROUND: Codon degeneracy and codon usage by organisms is an interesting and challenging problem. Researchers demonstrated the relation between codon usage and various functions or properties of genes and proteins, such as gene regulation, translation rate, translation efficiency, mRNA stability,...

Descripción completa

Detalles Bibliográficos
Autores principales: Foroughmand-Araabi, Mohammad-Hadi, Goliaei, Bahram, Alishahi, Kasra, Sadeghi, Mehdi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3896713/
https://www.ncbi.nlm.nih.gov/pubmed/24410898
http://dx.doi.org/10.1186/1742-4682-11-2
_version_ 1782300117483651072
author Foroughmand-Araabi, Mohammad-Hadi
Goliaei, Bahram
Alishahi, Kasra
Sadeghi, Mehdi
author_facet Foroughmand-Araabi, Mohammad-Hadi
Goliaei, Bahram
Alishahi, Kasra
Sadeghi, Mehdi
author_sort Foroughmand-Araabi, Mohammad-Hadi
collection PubMed
description BACKGROUND: Codon degeneracy and codon usage by organisms is an interesting and challenging problem. Researchers demonstrated the relation between codon usage and various functions or properties of genes and proteins, such as gene regulation, translation rate, translation efficiency, mRNA stability, splicing, and protein domains. Researchers usually represent segments of proteins responsible for specific functions or structures in a family of proteins as sequence patterns or motifs. We asked the question if organisms use the same codons in pattern segments as compared to the rest of the sequence. METHODS: We used the likelihood ratio test, Pearson’s chi-squared test, and mutual information to compare these two codon usages. RESULTS: We showed that codon usage, in segments of genes that code for a given pattern or motif in a group of proteins, varied from the rest of the gene. The codon usage in these segments was not random. Amino acids with larger number of codons used more specific codon ratios in these segments. We studied the number of amino acids in the pattern (pattern length). As patterns got longer, there was a slight decrease in the fraction of patterns with significant different codon usage in the pattern region as compared to codon usage in the gene region. We defined a measure of specificity of protein patterns, and studied its relation to the codon usage. The difference in the codon usage between pattern region and gene region, was less for the patterns with higher specificity. CONCLUSIONS: We provided a hypothesis that there are segments on genes that affect the codon usage and thus influence protein translation speed, and these regions are the regions that code protein pattern regions.
format Online
Article
Text
id pubmed-3896713
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38967132014-01-31 Dependency of codon usage on protein sequence patterns: a statistical study Foroughmand-Araabi, Mohammad-Hadi Goliaei, Bahram Alishahi, Kasra Sadeghi, Mehdi Theor Biol Med Model Research BACKGROUND: Codon degeneracy and codon usage by organisms is an interesting and challenging problem. Researchers demonstrated the relation between codon usage and various functions or properties of genes and proteins, such as gene regulation, translation rate, translation efficiency, mRNA stability, splicing, and protein domains. Researchers usually represent segments of proteins responsible for specific functions or structures in a family of proteins as sequence patterns or motifs. We asked the question if organisms use the same codons in pattern segments as compared to the rest of the sequence. METHODS: We used the likelihood ratio test, Pearson’s chi-squared test, and mutual information to compare these two codon usages. RESULTS: We showed that codon usage, in segments of genes that code for a given pattern or motif in a group of proteins, varied from the rest of the gene. The codon usage in these segments was not random. Amino acids with larger number of codons used more specific codon ratios in these segments. We studied the number of amino acids in the pattern (pattern length). As patterns got longer, there was a slight decrease in the fraction of patterns with significant different codon usage in the pattern region as compared to codon usage in the gene region. We defined a measure of specificity of protein patterns, and studied its relation to the codon usage. The difference in the codon usage between pattern region and gene region, was less for the patterns with higher specificity. CONCLUSIONS: We provided a hypothesis that there are segments on genes that affect the codon usage and thus influence protein translation speed, and these regions are the regions that code protein pattern regions. BioMed Central 2014-01-10 /pmc/articles/PMC3896713/ /pubmed/24410898 http://dx.doi.org/10.1186/1742-4682-11-2 Text en Copyright © 2014 Foroughmand-Araabi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Foroughmand-Araabi, Mohammad-Hadi
Goliaei, Bahram
Alishahi, Kasra
Sadeghi, Mehdi
Dependency of codon usage on protein sequence patterns: a statistical study
title Dependency of codon usage on protein sequence patterns: a statistical study
title_full Dependency of codon usage on protein sequence patterns: a statistical study
title_fullStr Dependency of codon usage on protein sequence patterns: a statistical study
title_full_unstemmed Dependency of codon usage on protein sequence patterns: a statistical study
title_short Dependency of codon usage on protein sequence patterns: a statistical study
title_sort dependency of codon usage on protein sequence patterns: a statistical study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3896713/
https://www.ncbi.nlm.nih.gov/pubmed/24410898
http://dx.doi.org/10.1186/1742-4682-11-2
work_keys_str_mv AT foroughmandaraabimohammadhadi dependencyofcodonusageonproteinsequencepatternsastatisticalstudy
AT goliaeibahram dependencyofcodonusageonproteinsequencepatternsastatisticalstudy
AT alishahikasra dependencyofcodonusageonproteinsequencepatternsastatisticalstudy
AT sadeghimehdi dependencyofcodonusageonproteinsequencepatternsastatisticalstudy