Cargando…

Version VI of the ESTree db: an improved tool for peach transcriptome analysis

BACKGROUND: The ESTree database (db) is a collection of Prunus persica and Prunus dulcis EST sequences that in its current version encompasses 75,404 sequences from 3 almond and 19 peach libraries. Nine peach genotypes and four peach tissues are represented, from four fruit developmental stages. The...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lazzari, Barbara, Caprera, Andrea, Vecchietti, Alberto, Merelli, Ivan, Barale, Francesca, Milanesi, Luciano, Stella, Alessandra, Pozzi, Carlo
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2008
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2323672/ https://www.ncbi.nlm.nih.gov/pubmed/18387211 http://dx.doi.org/10.1186/1471-2105-9-S2-S9

_version_	1782152676311564288
author	Lazzari, Barbara Caprera, Andrea Vecchietti, Alberto Merelli, Ivan Barale, Francesca Milanesi, Luciano Stella, Alessandra Pozzi, Carlo
author_facet	Lazzari, Barbara Caprera, Andrea Vecchietti, Alberto Merelli, Ivan Barale, Francesca Milanesi, Luciano Stella, Alessandra Pozzi, Carlo
author_sort	Lazzari, Barbara
collection	PubMed
description	BACKGROUND: The ESTree database (db) is a collection of Prunus persica and Prunus dulcis EST sequences that in its current version encompasses 75,404 sequences from 3 almond and 19 peach libraries. Nine peach genotypes and four peach tissues are represented, from four fruit developmental stages. The aim of this work was to implement the already existing ESTree db by adding new sequences and analysis programs. Particular care was given to the implementation of the web interface, that allows querying each of the database features. RESULTS: A Perl modular pipeline is the backbone of sequence analysis in the ESTree db project. Outputs obtained during the pipeline steps are automatically arrayed into the fields of a MySQL database. Apart from standard clustering and annotation analyses, version VI of the ESTree db encompasses new tools for tandem repeat identification, annotation against genomic Rosaceae sequences, and positioning on the database of oligomer sequences that were used in a peach microarray study. Furthermore, known protein patterns and motifs were identified by comparison to PROSITE. Based on data retrieved from sequence annotation against the UniProtKB database, a script was prepared to track positions of homologous hits on the GO tree and build statistics on the ontologies distribution in GO functional categories. EST mapping data were also integrated in the database. The PHP-based web interface was upgraded and extended. The aim of the authors was to enable querying the database according to all the biological aspects that can be investigated from the analysis of data available in the ESTree db. This is achieved by allowing multiple searches on logical subsets of sequences that represent different biological situations or features. CONCLUSIONS: The version VI of ESTree db offers a broad overview on peach gene expression. Sequence analyses results contained in the database, extensively linked to external related resources, represent a large amount of information that can be queried via the tools offered in the web interface. Flexibility and modularity of the ESTree analysis pipeline and of the web interface allowed the authors to set up similar structures for different datasets, with limited manual intervention.
format	Text
id	pubmed-2323672
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-23236722008-04-22 Version VI of the ESTree db: an improved tool for peach transcriptome analysis Lazzari, Barbara Caprera, Andrea Vecchietti, Alberto Merelli, Ivan Barale, Francesca Milanesi, Luciano Stella, Alessandra Pozzi, Carlo BMC Bioinformatics Research BACKGROUND: The ESTree database (db) is a collection of Prunus persica and Prunus dulcis EST sequences that in its current version encompasses 75,404 sequences from 3 almond and 19 peach libraries. Nine peach genotypes and four peach tissues are represented, from four fruit developmental stages. The aim of this work was to implement the already existing ESTree db by adding new sequences and analysis programs. Particular care was given to the implementation of the web interface, that allows querying each of the database features. RESULTS: A Perl modular pipeline is the backbone of sequence analysis in the ESTree db project. Outputs obtained during the pipeline steps are automatically arrayed into the fields of a MySQL database. Apart from standard clustering and annotation analyses, version VI of the ESTree db encompasses new tools for tandem repeat identification, annotation against genomic Rosaceae sequences, and positioning on the database of oligomer sequences that were used in a peach microarray study. Furthermore, known protein patterns and motifs were identified by comparison to PROSITE. Based on data retrieved from sequence annotation against the UniProtKB database, a script was prepared to track positions of homologous hits on the GO tree and build statistics on the ontologies distribution in GO functional categories. EST mapping data were also integrated in the database. The PHP-based web interface was upgraded and extended. The aim of the authors was to enable querying the database according to all the biological aspects that can be investigated from the analysis of data available in the ESTree db. This is achieved by allowing multiple searches on logical subsets of sequences that represent different biological situations or features. CONCLUSIONS: The version VI of ESTree db offers a broad overview on peach gene expression. Sequence analyses results contained in the database, extensively linked to external related resources, represent a large amount of information that can be queried via the tools offered in the web interface. Flexibility and modularity of the ESTree analysis pipeline and of the web interface allowed the authors to set up similar structures for different datasets, with limited manual intervention. BioMed Central 2008-03-26 /pmc/articles/PMC2323672/ /pubmed/18387211 http://dx.doi.org/10.1186/1471-2105-9-S2-S9 Text en Copyright © 2008 Lazzari et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Lazzari, Barbara Caprera, Andrea Vecchietti, Alberto Merelli, Ivan Barale, Francesca Milanesi, Luciano Stella, Alessandra Pozzi, Carlo Version VI of the ESTree db: an improved tool for peach transcriptome analysis
title	Version VI of the ESTree db: an improved tool for peach transcriptome analysis
title_full	Version VI of the ESTree db: an improved tool for peach transcriptome analysis
title_fullStr	Version VI of the ESTree db: an improved tool for peach transcriptome analysis
title_full_unstemmed	Version VI of the ESTree db: an improved tool for peach transcriptome analysis
title_short	Version VI of the ESTree db: an improved tool for peach transcriptome analysis
title_sort	version vi of the estree db: an improved tool for peach transcriptome analysis
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2323672/ https://www.ncbi.nlm.nih.gov/pubmed/18387211 http://dx.doi.org/10.1186/1471-2105-9-S2-S9
work_keys_str_mv	AT lazzaribarbara versionvioftheestreedbanimprovedtoolforpeachtranscriptomeanalysis AT capreraandrea versionvioftheestreedbanimprovedtoolforpeachtranscriptomeanalysis AT vecchiettialberto versionvioftheestreedbanimprovedtoolforpeachtranscriptomeanalysis AT merelliivan versionvioftheestreedbanimprovedtoolforpeachtranscriptomeanalysis AT baralefrancesca versionvioftheestreedbanimprovedtoolforpeachtranscriptomeanalysis AT milanesiluciano versionvioftheestreedbanimprovedtoolforpeachtranscriptomeanalysis AT stellaalessandra versionvioftheestreedbanimprovedtoolforpeachtranscriptomeanalysis AT pozzicarlo versionvioftheestreedbanimprovedtoolforpeachtranscriptomeanalysis

Version VI of the ESTree db: an improved tool for peach transcriptome analysis

Ejemplares similares