Cargando…
Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps
BACKGROUND: Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on t...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5397707/ https://www.ncbi.nlm.nih.gov/pubmed/28427327 http://dx.doi.org/10.1186/s12864-017-3704-8 |
_version_ | 1783230317742522368 |
---|---|
author | Huang, Xing Xu, Jing Chen, Lin Wang, Yu Gu, Xiaobin Peng, Xuerong Yang, Guangyou |
author_facet | Huang, Xing Xu, Jing Chen, Lin Wang, Yu Gu, Xiaobin Peng, Xuerong Yang, Guangyou |
author_sort | Huang, Xing |
collection | PubMed |
description | BACKGROUND: Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. RESULTS: Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as “optimal codons”. Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. CONCLUSIONS: In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3704-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5397707 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53977072017-04-20 Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps Huang, Xing Xu, Jing Chen, Lin Wang, Yu Gu, Xiaobin Peng, Xuerong Yang, Guangyou BMC Genomics Research Article BACKGROUND: Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. RESULTS: Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as “optimal codons”. Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. CONCLUSIONS: In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3704-8) contains supplementary material, which is available to authorized users. BioMed Central 2017-04-20 /pmc/articles/PMC5397707/ /pubmed/28427327 http://dx.doi.org/10.1186/s12864-017-3704-8 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Huang, Xing Xu, Jing Chen, Lin Wang, Yu Gu, Xiaobin Peng, Xuerong Yang, Guangyou Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps |
title | Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps |
title_full | Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps |
title_fullStr | Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps |
title_full_unstemmed | Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps |
title_short | Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps |
title_sort | analysis of transcriptome data reveals multifactor constraint on codon usage in taenia multiceps |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5397707/ https://www.ncbi.nlm.nih.gov/pubmed/28427327 http://dx.doi.org/10.1186/s12864-017-3704-8 |
work_keys_str_mv | AT huangxing analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps AT xujing analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps AT chenlin analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps AT wangyu analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps AT guxiaobin analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps AT pengxuerong analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps AT yangguangyou analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps |