Cargando…

Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants

BACKGROUND: While the gargantuan multi-nation effort of sequencing T. aestivum gets close to completion, the annotation process for the vast number of wheat genes and proteins is in its infancy. Previous experimental studies carried out on model plant organisms such as A. thaliana and O. sativa prov...

Descripción completa

Detalles Bibliográficos
Autores principales: Tulpan, Dan, Leger, Serge, Tchagang, Alain, Pan, Youlian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426649/
https://www.ncbi.nlm.nih.gov/pubmed/25887590
http://dx.doi.org/10.1186/s12864-015-1496-2
_version_ 1782370611443531776
author Tulpan, Dan
Leger, Serge
Tchagang, Alain
Pan, Youlian
author_facet Tulpan, Dan
Leger, Serge
Tchagang, Alain
Pan, Youlian
author_sort Tulpan, Dan
collection PubMed
description BACKGROUND: While the gargantuan multi-nation effort of sequencing T. aestivum gets close to completion, the annotation process for the vast number of wheat genes and proteins is in its infancy. Previous experimental studies carried out on model plant organisms such as A. thaliana and O. sativa provide a plethora of gene annotations that can be used as potential starting points for wheat gene annotations, proven that solid cross-species gene-to-gene and protein-to-protein correspondences are provided. RESULTS: DNA and protein sequences and corresponding annotations for T. aestivum and 9 other plant species were collected from Ensembl Plants release 22 and curated. Cliques of predicted 1-to-1 orthologs were identified and an annotation enrichment model was defined based on existing gene-GO term associations and phylogenetic relationships among wheat and 9 other plant species. A total of 13 cliques of size 10 were identified, which represent putative functionally equivalent genes and proteins in the 10 plant species. Eighty-five new and more specific GO terms were associated with wheat genes in the 13 cliques of size 10, which represent a 65% increase compared with the previously 130 known GO terms. Similar expression patterns for 4 genes from Arabidopsis, barley, maize and rice in cliques of size 10 provide experimental evidence to support our model. Overall, based on clique size equal or larger than 3, our model enriched the existing gene-GO term associations for 7,838 (8%) wheat genes, of which 2,139 had no previous annotation. CONCLUSIONS: Our novel comparative genomics approach enriches existing T. aestivum gene annotations based on cliques of predicted 1-to-1 orthologs, phylogenetic relationships and existing gene ontologies from 9 other plant species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1496-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4426649
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44266492015-05-12 Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants Tulpan, Dan Leger, Serge Tchagang, Alain Pan, Youlian BMC Genomics Research Article BACKGROUND: While the gargantuan multi-nation effort of sequencing T. aestivum gets close to completion, the annotation process for the vast number of wheat genes and proteins is in its infancy. Previous experimental studies carried out on model plant organisms such as A. thaliana and O. sativa provide a plethora of gene annotations that can be used as potential starting points for wheat gene annotations, proven that solid cross-species gene-to-gene and protein-to-protein correspondences are provided. RESULTS: DNA and protein sequences and corresponding annotations for T. aestivum and 9 other plant species were collected from Ensembl Plants release 22 and curated. Cliques of predicted 1-to-1 orthologs were identified and an annotation enrichment model was defined based on existing gene-GO term associations and phylogenetic relationships among wheat and 9 other plant species. A total of 13 cliques of size 10 were identified, which represent putative functionally equivalent genes and proteins in the 10 plant species. Eighty-five new and more specific GO terms were associated with wheat genes in the 13 cliques of size 10, which represent a 65% increase compared with the previously 130 known GO terms. Similar expression patterns for 4 genes from Arabidopsis, barley, maize and rice in cliques of size 10 provide experimental evidence to support our model. Overall, based on clique size equal or larger than 3, our model enriched the existing gene-GO term associations for 7,838 (8%) wheat genes, of which 2,139 had no previous annotation. CONCLUSIONS: Our novel comparative genomics approach enriches existing T. aestivum gene annotations based on cliques of predicted 1-to-1 orthologs, phylogenetic relationships and existing gene ontologies from 9 other plant species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1496-2) contains supplementary material, which is available to authorized users. BioMed Central 2015-04-15 /pmc/articles/PMC4426649/ /pubmed/25887590 http://dx.doi.org/10.1186/s12864-015-1496-2 Text en © Tulpan et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Tulpan, Dan
Leger, Serge
Tchagang, Alain
Pan, Youlian
Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants
title Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants
title_full Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants
title_fullStr Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants
title_full_unstemmed Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants
title_short Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants
title_sort enrichment of triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426649/
https://www.ncbi.nlm.nih.gov/pubmed/25887590
http://dx.doi.org/10.1186/s12864-015-1496-2
work_keys_str_mv AT tulpandan enrichmentoftriticumaestivumgeneannotationsusingorthologcliquesandgeneontologiesinotherplants
AT legerserge enrichmentoftriticumaestivumgeneannotationsusingorthologcliquesandgeneontologiesinotherplants
AT tchagangalain enrichmentoftriticumaestivumgeneannotationsusingorthologcliquesandgeneontologiesinotherplants
AT panyoulian enrichmentoftriticumaestivumgeneannotationsusingorthologcliquesandgeneontologiesinotherplants