Cargando…

dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts

The functional annotation of genes based on sequence homology with genes from model species genomes is time-consuming because it is necessary to mine several unrelated databases. The aim of the present work was to develop a functional annotation database for common wheat Triticum aestivum (L.). The...

Descripción completa

Detalles Bibliográficos
Autores principales: Vincent, Jonathan, Dai, Zhanwu, Ravel, Catherine, Choulet, Frédéric, Mouzeyar, Said, Bouzidi, M. Fouad, Agier, Marie, Martre, Pierre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3649639/
https://www.ncbi.nlm.nih.gov/pubmed/23660284
http://dx.doi.org/10.1093/database/bat014
_version_ 1782269007341027328
author Vincent, Jonathan
Dai, Zhanwu
Ravel, Catherine
Choulet, Frédéric
Mouzeyar, Said
Bouzidi, M. Fouad
Agier, Marie
Martre, Pierre
author_facet Vincent, Jonathan
Dai, Zhanwu
Ravel, Catherine
Choulet, Frédéric
Mouzeyar, Said
Bouzidi, M. Fouad
Agier, Marie
Martre, Pierre
author_sort Vincent, Jonathan
collection PubMed
description The functional annotation of genes based on sequence homology with genes from model species genomes is time-consuming because it is necessary to mine several unrelated databases. The aim of the present work was to develop a functional annotation database for common wheat Triticum aestivum (L.). The database, named dbWFA, is based on the reference NCBI UniGene set, an expressed gene catalogue built by expressed sequence tag clustering, and on full-length coding sequences retrieved from the TriFLDB database. Information from good-quality heterogeneous sources, including annotations for model plant species Arabidopsis thaliana (L.) Heynh. and Oryza sativa L., was gathered and linked to T. aestivum sequences through BLAST-based homology searches. Even though the complexity of the transcriptome cannot yet be fully appreciated, we developed a tool to easily and promptly obtain information from multiple functional annotation systems (Gene Ontology, MapMan bin codes, MIPS Functional Categories, PlantCyc pathway reactions and TAIR gene families). The use of dbWFA is illustrated here with several query examples. We were able to assign a putative function to 45% of the UniGenes and 81% of the full-length coding sequences from TriFLDB. Moreover, comparison of the annotation of the whole T. aestivum UniGene set along with curated annotations of the two model species assessed the accuracy of the annotation provided by dbWFA. To further illustrate the use of dbWFA, genes specifically expressed during the early cell division or late storage polymer accumulation phases of T. aestivum grain development were identified using a clustering analysis and then annotated using dbWFA. The annotation of these two sets of genes was consistent with previous analyses of T. aestivum grain transcriptomes and proteomes. Database URL: urgi.versailles.inra.fr/dbWFA/
format Online
Article
Text
id pubmed-3649639
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36496392013-05-13 dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts Vincent, Jonathan Dai, Zhanwu Ravel, Catherine Choulet, Frédéric Mouzeyar, Said Bouzidi, M. Fouad Agier, Marie Martre, Pierre Database (Oxford) Database Tool The functional annotation of genes based on sequence homology with genes from model species genomes is time-consuming because it is necessary to mine several unrelated databases. The aim of the present work was to develop a functional annotation database for common wheat Triticum aestivum (L.). The database, named dbWFA, is based on the reference NCBI UniGene set, an expressed gene catalogue built by expressed sequence tag clustering, and on full-length coding sequences retrieved from the TriFLDB database. Information from good-quality heterogeneous sources, including annotations for model plant species Arabidopsis thaliana (L.) Heynh. and Oryza sativa L., was gathered and linked to T. aestivum sequences through BLAST-based homology searches. Even though the complexity of the transcriptome cannot yet be fully appreciated, we developed a tool to easily and promptly obtain information from multiple functional annotation systems (Gene Ontology, MapMan bin codes, MIPS Functional Categories, PlantCyc pathway reactions and TAIR gene families). The use of dbWFA is illustrated here with several query examples. We were able to assign a putative function to 45% of the UniGenes and 81% of the full-length coding sequences from TriFLDB. Moreover, comparison of the annotation of the whole T. aestivum UniGene set along with curated annotations of the two model species assessed the accuracy of the annotation provided by dbWFA. To further illustrate the use of dbWFA, genes specifically expressed during the early cell division or late storage polymer accumulation phases of T. aestivum grain development were identified using a clustering analysis and then annotated using dbWFA. The annotation of these two sets of genes was consistent with previous analyses of T. aestivum grain transcriptomes and proteomes. Database URL: urgi.versailles.inra.fr/dbWFA/ Oxford University Press 2013-05-09 /pmc/articles/PMC3649639/ /pubmed/23660284 http://dx.doi.org/10.1093/database/bat014 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Tool
Vincent, Jonathan
Dai, Zhanwu
Ravel, Catherine
Choulet, Frédéric
Mouzeyar, Said
Bouzidi, M. Fouad
Agier, Marie
Martre, Pierre
dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts
title dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts
title_full dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts
title_fullStr dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts
title_full_unstemmed dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts
title_short dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts
title_sort dbwfa: a web-based database for functional annotation of triticum aestivum transcripts
topic Database Tool
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3649639/
https://www.ncbi.nlm.nih.gov/pubmed/23660284
http://dx.doi.org/10.1093/database/bat014
work_keys_str_mv AT vincentjonathan dbwfaawebbaseddatabaseforfunctionalannotationoftriticumaestivumtranscripts
AT daizhanwu dbwfaawebbaseddatabaseforfunctionalannotationoftriticumaestivumtranscripts
AT ravelcatherine dbwfaawebbaseddatabaseforfunctionalannotationoftriticumaestivumtranscripts
AT chouletfrederic dbwfaawebbaseddatabaseforfunctionalannotationoftriticumaestivumtranscripts
AT mouzeyarsaid dbwfaawebbaseddatabaseforfunctionalannotationoftriticumaestivumtranscripts
AT bouzidimfouad dbwfaawebbaseddatabaseforfunctionalannotationoftriticumaestivumtranscripts
AT agiermarie dbwfaawebbaseddatabaseforfunctionalannotationoftriticumaestivumtranscripts
AT martrepierre dbwfaawebbaseddatabaseforfunctionalannotationoftriticumaestivumtranscripts