Cargando…

Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps

BACKGROUND: Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on t...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Xing, Xu, Jing, Chen, Lin, Wang, Yu, Gu, Xiaobin, Peng, Xuerong, Yang, Guangyou
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5397707/
https://www.ncbi.nlm.nih.gov/pubmed/28427327
http://dx.doi.org/10.1186/s12864-017-3704-8
_version_ 1783230317742522368
author Huang, Xing
Xu, Jing
Chen, Lin
Wang, Yu
Gu, Xiaobin
Peng, Xuerong
Yang, Guangyou
author_facet Huang, Xing
Xu, Jing
Chen, Lin
Wang, Yu
Gu, Xiaobin
Peng, Xuerong
Yang, Guangyou
author_sort Huang, Xing
collection PubMed
description BACKGROUND: Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. RESULTS: Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as “optimal codons”. Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. CONCLUSIONS: In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3704-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5397707
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53977072017-04-20 Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps Huang, Xing Xu, Jing Chen, Lin Wang, Yu Gu, Xiaobin Peng, Xuerong Yang, Guangyou BMC Genomics Research Article BACKGROUND: Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. RESULTS: Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as “optimal codons”. Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. CONCLUSIONS: In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3704-8) contains supplementary material, which is available to authorized users. BioMed Central 2017-04-20 /pmc/articles/PMC5397707/ /pubmed/28427327 http://dx.doi.org/10.1186/s12864-017-3704-8 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Huang, Xing
Xu, Jing
Chen, Lin
Wang, Yu
Gu, Xiaobin
Peng, Xuerong
Yang, Guangyou
Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps
title Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps
title_full Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps
title_fullStr Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps
title_full_unstemmed Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps
title_short Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps
title_sort analysis of transcriptome data reveals multifactor constraint on codon usage in taenia multiceps
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5397707/
https://www.ncbi.nlm.nih.gov/pubmed/28427327
http://dx.doi.org/10.1186/s12864-017-3704-8
work_keys_str_mv AT huangxing analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps
AT xujing analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps
AT chenlin analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps
AT wangyu analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps
AT guxiaobin analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps
AT pengxuerong analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps
AT yangguangyou analysisoftranscriptomedatarevealsmultifactorconstraintoncodonusageintaeniamulticeps