Cargando…

The Human EST Ontology Explorer: a tissue-oriented visualization system for ontologies distribution in human EST collections

BACKGROUND: The NCBI dbEST currently contains more than eight million human Expressed Sequenced Tags (ESTs). This wide collection represents an important source of information for gene expression studies, provided it can be inspected according to biologically relevant criteria. EST data can be brows...

Descripción completa

Detalles Bibliográficos
Autores principales: Merelli, Ivan, Caprera, Andrea, Stella, Alessandra, Del Corvo, Marcello, Milanesi, Luciano, Lazzari, Barbara
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2762067/
https://www.ncbi.nlm.nih.gov/pubmed/19828078
http://dx.doi.org/10.1186/1471-2105-10-S12-S2
_version_ 1782172890157809664
author Merelli, Ivan
Caprera, Andrea
Stella, Alessandra
Del Corvo, Marcello
Milanesi, Luciano
Lazzari, Barbara
author_facet Merelli, Ivan
Caprera, Andrea
Stella, Alessandra
Del Corvo, Marcello
Milanesi, Luciano
Lazzari, Barbara
author_sort Merelli, Ivan
collection PubMed
description BACKGROUND: The NCBI dbEST currently contains more than eight million human Expressed Sequenced Tags (ESTs). This wide collection represents an important source of information for gene expression studies, provided it can be inspected according to biologically relevant criteria. EST data can be browsed using different dedicated web resources, which allow to investigate library specific gene expression levels and to make comparisons among libraries, highlighting significant differences in gene expression. Nonetheless, no tool is available to examine distributions of quantitative EST collections in Gene Ontology (GO) categories, nor to retrieve information concerning library-dependent EST involvement in metabolic pathways. In this work we present the Human EST Ontology Explorer (HEOE) , a web facility for comparison of expression levels among libraries from several healthy and diseased tissues. RESULTS: The HEOE provides library-dependent statistics on the distribution of sequences in the GO Direct Acyclic Graph (DAG) that can be browsed at each GO hierarchical level. The tool is based on large-scale BLAST annotation of EST sequences. Due to the huge number of input sequences, this BLAST analysis was performed with the aid of grid computing technology, which is particularly suitable to address data parallel task. Relying on the achieved annotation, library-specific distributions of ESTs in the GO Graph were inferred. A pathway-based search interface was also implemented, for a quick evaluation of the representation of libraries in metabolic pathways. EST processing steps were integrated in a semi-automatic procedure that relies on Perl scripts and stores results in a MySQL database. A PHP-based web interface offers the possibility to simultaneously visualize, retrieve and compare data from the different libraries. Statistically significant differences in GO categories among user selected libraries can also be computed. CONCLUSION: The HEOE provides an alternative and complementary way to inspect EST expression levels with respect to approaches currently offered by other resources. Furthermore, BLAST computation on the whole human EST dataset was a suitable test of grid scalability in the context of large-scale bioinformatics analysis. The HEOE currently comprises sequence analysis from 70 non-normalized libraries, representing a comprehensive overview on healthy and unhealthy tissues. As the analysis procedure can be easily applied to other libraries, the number of represented tissues is intended to increase.
format Text
id pubmed-2762067
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27620672009-10-15 The Human EST Ontology Explorer: a tissue-oriented visualization system for ontologies distribution in human EST collections Merelli, Ivan Caprera, Andrea Stella, Alessandra Del Corvo, Marcello Milanesi, Luciano Lazzari, Barbara BMC Bioinformatics Research BACKGROUND: The NCBI dbEST currently contains more than eight million human Expressed Sequenced Tags (ESTs). This wide collection represents an important source of information for gene expression studies, provided it can be inspected according to biologically relevant criteria. EST data can be browsed using different dedicated web resources, which allow to investigate library specific gene expression levels and to make comparisons among libraries, highlighting significant differences in gene expression. Nonetheless, no tool is available to examine distributions of quantitative EST collections in Gene Ontology (GO) categories, nor to retrieve information concerning library-dependent EST involvement in metabolic pathways. In this work we present the Human EST Ontology Explorer (HEOE) , a web facility for comparison of expression levels among libraries from several healthy and diseased tissues. RESULTS: The HEOE provides library-dependent statistics on the distribution of sequences in the GO Direct Acyclic Graph (DAG) that can be browsed at each GO hierarchical level. The tool is based on large-scale BLAST annotation of EST sequences. Due to the huge number of input sequences, this BLAST analysis was performed with the aid of grid computing technology, which is particularly suitable to address data parallel task. Relying on the achieved annotation, library-specific distributions of ESTs in the GO Graph were inferred. A pathway-based search interface was also implemented, for a quick evaluation of the representation of libraries in metabolic pathways. EST processing steps were integrated in a semi-automatic procedure that relies on Perl scripts and stores results in a MySQL database. A PHP-based web interface offers the possibility to simultaneously visualize, retrieve and compare data from the different libraries. Statistically significant differences in GO categories among user selected libraries can also be computed. CONCLUSION: The HEOE provides an alternative and complementary way to inspect EST expression levels with respect to approaches currently offered by other resources. Furthermore, BLAST computation on the whole human EST dataset was a suitable test of grid scalability in the context of large-scale bioinformatics analysis. The HEOE currently comprises sequence analysis from 70 non-normalized libraries, representing a comprehensive overview on healthy and unhealthy tissues. As the analysis procedure can be easily applied to other libraries, the number of represented tissues is intended to increase. BioMed Central 2009-10-15 /pmc/articles/PMC2762067/ /pubmed/19828078 http://dx.doi.org/10.1186/1471-2105-10-S12-S2 Text en Copyright © 2009 Merelli et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Merelli, Ivan
Caprera, Andrea
Stella, Alessandra
Del Corvo, Marcello
Milanesi, Luciano
Lazzari, Barbara
The Human EST Ontology Explorer: a tissue-oriented visualization system for ontologies distribution in human EST collections
title The Human EST Ontology Explorer: a tissue-oriented visualization system for ontologies distribution in human EST collections
title_full The Human EST Ontology Explorer: a tissue-oriented visualization system for ontologies distribution in human EST collections
title_fullStr The Human EST Ontology Explorer: a tissue-oriented visualization system for ontologies distribution in human EST collections
title_full_unstemmed The Human EST Ontology Explorer: a tissue-oriented visualization system for ontologies distribution in human EST collections
title_short The Human EST Ontology Explorer: a tissue-oriented visualization system for ontologies distribution in human EST collections
title_sort human est ontology explorer: a tissue-oriented visualization system for ontologies distribution in human est collections
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2762067/
https://www.ncbi.nlm.nih.gov/pubmed/19828078
http://dx.doi.org/10.1186/1471-2105-10-S12-S2
work_keys_str_mv AT merelliivan thehumanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT capreraandrea thehumanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT stellaalessandra thehumanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT delcorvomarcello thehumanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT milanesiluciano thehumanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT lazzaribarbara thehumanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT merelliivan humanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT capreraandrea humanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT stellaalessandra humanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT delcorvomarcello humanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT milanesiluciano humanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections
AT lazzaribarbara humanestontologyexploreratissueorientedvisualizationsystemforontologiesdistributioninhumanestcollections