Cargando…

ESTuber db: an online database for Tuber borchii EST sequences

BACKGROUND: The ESTuber database () includes 3,271 Tuber borchii expressed sequence tags (EST). The dataset consists of 2,389 sequences from an in-house prepared cDNA library from truffle vegetative hyphae, and 882 sequences downloaded from GenBank and representing four libraries from white truffle...

Descripción completa

Detalles Bibliográficos
Autores principales: Lazzari, Barbara, Caprera, Andrea, Cosentino, Cristian, Stella, Alessandra, Milanesi, Luciano, Viotti, Angelo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1885842/
https://www.ncbi.nlm.nih.gov/pubmed/17430557
http://dx.doi.org/10.1186/1471-2105-8-S1-S13
_version_ 1782133653974810624
author Lazzari, Barbara
Caprera, Andrea
Cosentino, Cristian
Stella, Alessandra
Milanesi, Luciano
Viotti, Angelo
author_facet Lazzari, Barbara
Caprera, Andrea
Cosentino, Cristian
Stella, Alessandra
Milanesi, Luciano
Viotti, Angelo
author_sort Lazzari, Barbara
collection PubMed
description BACKGROUND: The ESTuber database () includes 3,271 Tuber borchii expressed sequence tags (EST). The dataset consists of 2,389 sequences from an in-house prepared cDNA library from truffle vegetative hyphae, and 882 sequences downloaded from GenBank and representing four libraries from white truffle mycelia and ascocarps at different developmental stages. An automated pipeline was prepared to process EST sequences using public software integrated by in-house developed Perl scripts. Data were collected in a MySQL database, which can be queried via a php-based web interface. RESULTS: Sequences included in the ESTuber db were clustered and annotated against three databases: the GenBank nr database, the UniProtKB database and a third in-house prepared database of fungi genomic sequences. An algorithm was implemented to infer statistical classification among Gene Ontology categories from the ontology occurrences deduced from the annotation procedure against the UniProtKB database. Ontologies were also deduced from the annotation of more than 130,000 EST sequences from five filamentous fungi, for intra-species comparison purposes. Further analyses were performed on the ESTuber db dataset, including tandem repeats search and comparison of the putative protein dataset inferred from the EST sequences to the PROSITE database for protein patterns identification. All the analyses were performed both on the complete sequence dataset and on the contig consensus sequences generated by the EST assembly procedure. CONCLUSION: The resulting web site is a resource of data and links related to truffle expressed genes. The Sequence Report and Contig Report pages are the web interface core structures which, together with the Text search utility and the Blast utility, allow easy access to the data stored in the database.
format Text
id pubmed-1885842
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18858422007-06-05 ESTuber db: an online database for Tuber borchii EST sequences Lazzari, Barbara Caprera, Andrea Cosentino, Cristian Stella, Alessandra Milanesi, Luciano Viotti, Angelo BMC Bioinformatics Research BACKGROUND: The ESTuber database () includes 3,271 Tuber borchii expressed sequence tags (EST). The dataset consists of 2,389 sequences from an in-house prepared cDNA library from truffle vegetative hyphae, and 882 sequences downloaded from GenBank and representing four libraries from white truffle mycelia and ascocarps at different developmental stages. An automated pipeline was prepared to process EST sequences using public software integrated by in-house developed Perl scripts. Data were collected in a MySQL database, which can be queried via a php-based web interface. RESULTS: Sequences included in the ESTuber db were clustered and annotated against three databases: the GenBank nr database, the UniProtKB database and a third in-house prepared database of fungi genomic sequences. An algorithm was implemented to infer statistical classification among Gene Ontology categories from the ontology occurrences deduced from the annotation procedure against the UniProtKB database. Ontologies were also deduced from the annotation of more than 130,000 EST sequences from five filamentous fungi, for intra-species comparison purposes. Further analyses were performed on the ESTuber db dataset, including tandem repeats search and comparison of the putative protein dataset inferred from the EST sequences to the PROSITE database for protein patterns identification. All the analyses were performed both on the complete sequence dataset and on the contig consensus sequences generated by the EST assembly procedure. CONCLUSION: The resulting web site is a resource of data and links related to truffle expressed genes. The Sequence Report and Contig Report pages are the web interface core structures which, together with the Text search utility and the Blast utility, allow easy access to the data stored in the database. BioMed Central 2007-03-08 /pmc/articles/PMC1885842/ /pubmed/17430557 http://dx.doi.org/10.1186/1471-2105-8-S1-S13 Text en Copyright © 2007 Lazzari et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Lazzari, Barbara
Caprera, Andrea
Cosentino, Cristian
Stella, Alessandra
Milanesi, Luciano
Viotti, Angelo
ESTuber db: an online database for Tuber borchii EST sequences
title ESTuber db: an online database for Tuber borchii EST sequences
title_full ESTuber db: an online database for Tuber borchii EST sequences
title_fullStr ESTuber db: an online database for Tuber borchii EST sequences
title_full_unstemmed ESTuber db: an online database for Tuber borchii EST sequences
title_short ESTuber db: an online database for Tuber borchii EST sequences
title_sort estuber db: an online database for tuber borchii est sequences
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1885842/
https://www.ncbi.nlm.nih.gov/pubmed/17430557
http://dx.doi.org/10.1186/1471-2105-8-S1-S13
work_keys_str_mv AT lazzaribarbara estuberdbanonlinedatabasefortuberborchiiestsequences
AT capreraandrea estuberdbanonlinedatabasefortuberborchiiestsequences
AT cosentinocristian estuberdbanonlinedatabasefortuberborchiiestsequences
AT stellaalessandra estuberdbanonlinedatabasefortuberborchiiestsequences
AT milanesiluciano estuberdbanonlinedatabasefortuberborchiiestsequences
AT viottiangelo estuberdbanonlinedatabasefortuberborchiiestsequences