Cargando…

An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome

Background: Although the cucumber reference genome and its annotation were published several years ago, the functional annotation of predicted genes, particularly protein-coding genes, still requires further improvement. In general, accurately determining orthologous relationships between genes allo...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Hongtao, Lin, Kui, Hu, Jinglu, Pang, Erli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5863696/
https://www.ncbi.nlm.nih.gov/pubmed/29599790
http://dx.doi.org/10.3389/fpls.2018.00325
_version_ 1783308426470752256
author Song, Hongtao
Lin, Kui
Hu, Jinglu
Pang, Erli
author_facet Song, Hongtao
Lin, Kui
Hu, Jinglu
Pang, Erli
author_sort Song, Hongtao
collection PubMed
description Background: Although the cucumber reference genome and its annotation were published several years ago, the functional annotation of predicted genes, particularly protein-coding genes, still requires further improvement. In general, accurately determining orthologous relationships between genes allows for better and more robust functional assignments of predicted genes. As one of the most reliable strategies, the determination of collinearity information may facilitate reliable orthology inferences among genes from multiple related genomes. Currently, the identification of collinear segments has mainly been based on conservation of gene order and orientation. Over the course of plant genome evolution, various evolutionary events have disrupted or distorted the order of genes along chromosomes, making it difficult to use those genes as genome-wide markers for plant genome comparisons. Results: Using the localized LASTZ/MULTIZ analysis pipeline, we aligned 15 genomes, including cucumber and other related angiosperm plants, and identified a set of genomic segments that are short in length, stable in structure, uniform in distribution and highly conserved across all 15 plants. Compared with protein-coding genes, these conserved segments were more suitable for use as genomic markers for detecting collinear segments among distantly divergent plants. Guided by this set of identified collinear genomic segments, we inferred 94,486 orthologous protein-coding gene pairs (OPPs) between cucumber and 14 other angiosperm species, which were used as proxies for transferring functional terms to cucumber genes from the annotations of the other 14 genomes. In total, 10,885 protein-coding genes were assigned Gene Ontology (GO) terms which was nearly 1,300 more than results collected in Uniprot-proteomic database. Our results showed that annotation accuracy would been improved compared with other existing approaches. Conclusions: In this study, we provided an alternative resource for the functional annotation of predicted cucumber protein-coding genes, which we expect will be beneficial for the cucumber's biological study, accessible from http://cmb.bnu.edu.cn/functional_annotation. Meanwhile, using the cucumber reference genome as a case study, we presented an efficient strategy for transferring gene functional information from previously well-characterized protein-coding genes in model species to newly sequenced or “non-model” plant species.
format Online
Article
Text
id pubmed-5863696
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-58636962018-03-29 An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome Song, Hongtao Lin, Kui Hu, Jinglu Pang, Erli Front Plant Sci Plant Science Background: Although the cucumber reference genome and its annotation were published several years ago, the functional annotation of predicted genes, particularly protein-coding genes, still requires further improvement. In general, accurately determining orthologous relationships between genes allows for better and more robust functional assignments of predicted genes. As one of the most reliable strategies, the determination of collinearity information may facilitate reliable orthology inferences among genes from multiple related genomes. Currently, the identification of collinear segments has mainly been based on conservation of gene order and orientation. Over the course of plant genome evolution, various evolutionary events have disrupted or distorted the order of genes along chromosomes, making it difficult to use those genes as genome-wide markers for plant genome comparisons. Results: Using the localized LASTZ/MULTIZ analysis pipeline, we aligned 15 genomes, including cucumber and other related angiosperm plants, and identified a set of genomic segments that are short in length, stable in structure, uniform in distribution and highly conserved across all 15 plants. Compared with protein-coding genes, these conserved segments were more suitable for use as genomic markers for detecting collinear segments among distantly divergent plants. Guided by this set of identified collinear genomic segments, we inferred 94,486 orthologous protein-coding gene pairs (OPPs) between cucumber and 14 other angiosperm species, which were used as proxies for transferring functional terms to cucumber genes from the annotations of the other 14 genomes. In total, 10,885 protein-coding genes were assigned Gene Ontology (GO) terms which was nearly 1,300 more than results collected in Uniprot-proteomic database. Our results showed that annotation accuracy would been improved compared with other existing approaches. Conclusions: In this study, we provided an alternative resource for the functional annotation of predicted cucumber protein-coding genes, which we expect will be beneficial for the cucumber's biological study, accessible from http://cmb.bnu.edu.cn/functional_annotation. Meanwhile, using the cucumber reference genome as a case study, we presented an efficient strategy for transferring gene functional information from previously well-characterized protein-coding genes in model species to newly sequenced or “non-model” plant species. Frontiers Media S.A. 2018-03-15 /pmc/articles/PMC5863696/ /pubmed/29599790 http://dx.doi.org/10.3389/fpls.2018.00325 Text en Copyright © 2018 Song, Lin, Hu and Pang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Song, Hongtao
Lin, Kui
Hu, Jinglu
Pang, Erli
An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome
title An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome
title_full An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome
title_fullStr An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome
title_full_unstemmed An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome
title_short An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome
title_sort updated functional annotation of protein-coding genes in the cucumber genome
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5863696/
https://www.ncbi.nlm.nih.gov/pubmed/29599790
http://dx.doi.org/10.3389/fpls.2018.00325
work_keys_str_mv AT songhongtao anupdatedfunctionalannotationofproteincodinggenesinthecucumbergenome
AT linkui anupdatedfunctionalannotationofproteincodinggenesinthecucumbergenome
AT hujinglu anupdatedfunctionalannotationofproteincodinggenesinthecucumbergenome
AT pangerli anupdatedfunctionalannotationofproteincodinggenesinthecucumbergenome
AT songhongtao updatedfunctionalannotationofproteincodinggenesinthecucumbergenome
AT linkui updatedfunctionalannotationofproteincodinggenesinthecucumbergenome
AT hujinglu updatedfunctionalannotationofproteincodinggenesinthecucumbergenome
AT pangerli updatedfunctionalannotationofproteincodinggenesinthecucumbergenome