Cargando…
Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants
BACKGROUND: While the gargantuan multi-nation effort of sequencing T. aestivum gets close to completion, the annotation process for the vast number of wheat genes and proteins is in its infancy. Previous experimental studies carried out on model plant organisms such as A. thaliana and O. sativa prov...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426649/ https://www.ncbi.nlm.nih.gov/pubmed/25887590 http://dx.doi.org/10.1186/s12864-015-1496-2 |
_version_ | 1782370611443531776 |
---|---|
author | Tulpan, Dan Leger, Serge Tchagang, Alain Pan, Youlian |
author_facet | Tulpan, Dan Leger, Serge Tchagang, Alain Pan, Youlian |
author_sort | Tulpan, Dan |
collection | PubMed |
description | BACKGROUND: While the gargantuan multi-nation effort of sequencing T. aestivum gets close to completion, the annotation process for the vast number of wheat genes and proteins is in its infancy. Previous experimental studies carried out on model plant organisms such as A. thaliana and O. sativa provide a plethora of gene annotations that can be used as potential starting points for wheat gene annotations, proven that solid cross-species gene-to-gene and protein-to-protein correspondences are provided. RESULTS: DNA and protein sequences and corresponding annotations for T. aestivum and 9 other plant species were collected from Ensembl Plants release 22 and curated. Cliques of predicted 1-to-1 orthologs were identified and an annotation enrichment model was defined based on existing gene-GO term associations and phylogenetic relationships among wheat and 9 other plant species. A total of 13 cliques of size 10 were identified, which represent putative functionally equivalent genes and proteins in the 10 plant species. Eighty-five new and more specific GO terms were associated with wheat genes in the 13 cliques of size 10, which represent a 65% increase compared with the previously 130 known GO terms. Similar expression patterns for 4 genes from Arabidopsis, barley, maize and rice in cliques of size 10 provide experimental evidence to support our model. Overall, based on clique size equal or larger than 3, our model enriched the existing gene-GO term associations for 7,838 (8%) wheat genes, of which 2,139 had no previous annotation. CONCLUSIONS: Our novel comparative genomics approach enriches existing T. aestivum gene annotations based on cliques of predicted 1-to-1 orthologs, phylogenetic relationships and existing gene ontologies from 9 other plant species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1496-2) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4426649 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44266492015-05-12 Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants Tulpan, Dan Leger, Serge Tchagang, Alain Pan, Youlian BMC Genomics Research Article BACKGROUND: While the gargantuan multi-nation effort of sequencing T. aestivum gets close to completion, the annotation process for the vast number of wheat genes and proteins is in its infancy. Previous experimental studies carried out on model plant organisms such as A. thaliana and O. sativa provide a plethora of gene annotations that can be used as potential starting points for wheat gene annotations, proven that solid cross-species gene-to-gene and protein-to-protein correspondences are provided. RESULTS: DNA and protein sequences and corresponding annotations for T. aestivum and 9 other plant species were collected from Ensembl Plants release 22 and curated. Cliques of predicted 1-to-1 orthologs were identified and an annotation enrichment model was defined based on existing gene-GO term associations and phylogenetic relationships among wheat and 9 other plant species. A total of 13 cliques of size 10 were identified, which represent putative functionally equivalent genes and proteins in the 10 plant species. Eighty-five new and more specific GO terms were associated with wheat genes in the 13 cliques of size 10, which represent a 65% increase compared with the previously 130 known GO terms. Similar expression patterns for 4 genes from Arabidopsis, barley, maize and rice in cliques of size 10 provide experimental evidence to support our model. Overall, based on clique size equal or larger than 3, our model enriched the existing gene-GO term associations for 7,838 (8%) wheat genes, of which 2,139 had no previous annotation. CONCLUSIONS: Our novel comparative genomics approach enriches existing T. aestivum gene annotations based on cliques of predicted 1-to-1 orthologs, phylogenetic relationships and existing gene ontologies from 9 other plant species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1496-2) contains supplementary material, which is available to authorized users. BioMed Central 2015-04-15 /pmc/articles/PMC4426649/ /pubmed/25887590 http://dx.doi.org/10.1186/s12864-015-1496-2 Text en © Tulpan et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Tulpan, Dan Leger, Serge Tchagang, Alain Pan, Youlian Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants |
title | Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants |
title_full | Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants |
title_fullStr | Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants |
title_full_unstemmed | Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants |
title_short | Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants |
title_sort | enrichment of triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426649/ https://www.ncbi.nlm.nih.gov/pubmed/25887590 http://dx.doi.org/10.1186/s12864-015-1496-2 |
work_keys_str_mv | AT tulpandan enrichmentoftriticumaestivumgeneannotationsusingorthologcliquesandgeneontologiesinotherplants AT legerserge enrichmentoftriticumaestivumgeneannotationsusingorthologcliquesandgeneontologiesinotherplants AT tchagangalain enrichmentoftriticumaestivumgeneannotationsusingorthologcliquesandgeneontologiesinotherplants AT panyoulian enrichmentoftriticumaestivumgeneannotationsusingorthologcliquesandgeneontologiesinotherplants |