Cargando…

Comparing gene annotation enrichment tools for functional modeling of agricultural microarray data

The widespread availability of microarray technology has driven functional genomics to the forefront as scientists seek to draw meaningful biological conclusions from their microarray results. Gene annotation enrichment analysis is a functional analysis technique that has gained widespread attention...

Descripción completa

Detalles Bibliográficos
Autores principales: Berg, Bart HJ van den, Thanthiriwatte, Chamali, Manda, Prashanti, Bridges, Susan M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226198/
https://www.ncbi.nlm.nih.gov/pubmed/19811693
http://dx.doi.org/10.1186/1471-2105-10-S11-S9
_version_ 1782217580753190912
author Berg, Bart HJ van den
Thanthiriwatte, Chamali
Manda, Prashanti
Bridges, Susan M
author_facet Berg, Bart HJ van den
Thanthiriwatte, Chamali
Manda, Prashanti
Bridges, Susan M
author_sort Berg, Bart HJ van den
collection PubMed
description The widespread availability of microarray technology has driven functional genomics to the forefront as scientists seek to draw meaningful biological conclusions from their microarray results. Gene annotation enrichment analysis is a functional analysis technique that has gained widespread attention and for which many tools have been developed. Unfortunately, most of these tools have limited support for agricultural species. Here, we evaluate and compare four publicly available computational tools (Onto-Express, EasyGO, GOstat, and DAVID) that support analysis of gene expression datasets in agricultural species. We use AgBase as the functional annotation reference for agricultural species. The selected tools were evaluated based on i) available features, usage and accessibility, ii) implemented statistical computational methods, and iii) annotation and enrichment performance analysis. Annotation was assessed using a randomly selected test gene annotation set and an experimental differentially expressed gene-set – both from chicken. The experimental set was also used to evaluate identification of enriched functional groups. Comparison of the tools shows that they produce different sets of annotations for the two datasets and different functional groups for the experimental dataset. While DAVID, GOstat and Onto-Express annotate comparable numbers of genes, DAVID provides by far the most annotations per gene. However, many of DAVID's annotations appear to be redundant or are at very high levels in the GO hierarchy. The GOSlim distribution of annotations shows that GOstat, Onto-Express and EasyGO provide similar GO distributions to those found in AgBase while annotations from DAVID show a different GOSlim distribution, again probably due to duplication and many non-specific terms. No consistent trends were found in results of GO term over/under representation analysis applied to the experimental data using different tools. While GOstat, David and Onto-Express could retrieve some significantly enriched terms, EasyGO did not show any significantly enriched terms. There was little agreement about the enriched terms identified by the tools. CONCLUSION: Different tools for functionally annotating gene sets and identifying significantly enriched GO categories differ widely in their results when applied to a test annotation gene set and an experimental dataset from chicken. These results emphasize the need for care when interpreting the results of such analysis and the lack of standardization of approaches.
format Online
Article
Text
id pubmed-3226198
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32261982011-11-30 Comparing gene annotation enrichment tools for functional modeling of agricultural microarray data Berg, Bart HJ van den Thanthiriwatte, Chamali Manda, Prashanti Bridges, Susan M BMC Bioinformatics Proceedings The widespread availability of microarray technology has driven functional genomics to the forefront as scientists seek to draw meaningful biological conclusions from their microarray results. Gene annotation enrichment analysis is a functional analysis technique that has gained widespread attention and for which many tools have been developed. Unfortunately, most of these tools have limited support for agricultural species. Here, we evaluate and compare four publicly available computational tools (Onto-Express, EasyGO, GOstat, and DAVID) that support analysis of gene expression datasets in agricultural species. We use AgBase as the functional annotation reference for agricultural species. The selected tools were evaluated based on i) available features, usage and accessibility, ii) implemented statistical computational methods, and iii) annotation and enrichment performance analysis. Annotation was assessed using a randomly selected test gene annotation set and an experimental differentially expressed gene-set – both from chicken. The experimental set was also used to evaluate identification of enriched functional groups. Comparison of the tools shows that they produce different sets of annotations for the two datasets and different functional groups for the experimental dataset. While DAVID, GOstat and Onto-Express annotate comparable numbers of genes, DAVID provides by far the most annotations per gene. However, many of DAVID's annotations appear to be redundant or are at very high levels in the GO hierarchy. The GOSlim distribution of annotations shows that GOstat, Onto-Express and EasyGO provide similar GO distributions to those found in AgBase while annotations from DAVID show a different GOSlim distribution, again probably due to duplication and many non-specific terms. No consistent trends were found in results of GO term over/under representation analysis applied to the experimental data using different tools. While GOstat, David and Onto-Express could retrieve some significantly enriched terms, EasyGO did not show any significantly enriched terms. There was little agreement about the enriched terms identified by the tools. CONCLUSION: Different tools for functionally annotating gene sets and identifying significantly enriched GO categories differ widely in their results when applied to a test annotation gene set and an experimental dataset from chicken. These results emphasize the need for care when interpreting the results of such analysis and the lack of standardization of approaches. BioMed Central 2009-10-08 /pmc/articles/PMC3226198/ /pubmed/19811693 http://dx.doi.org/10.1186/1471-2105-10-S11-S9 Text en Copyright ©2009 Berg et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Berg, Bart HJ van den
Thanthiriwatte, Chamali
Manda, Prashanti
Bridges, Susan M
Comparing gene annotation enrichment tools for functional modeling of agricultural microarray data
title Comparing gene annotation enrichment tools for functional modeling of agricultural microarray data
title_full Comparing gene annotation enrichment tools for functional modeling of agricultural microarray data
title_fullStr Comparing gene annotation enrichment tools for functional modeling of agricultural microarray data
title_full_unstemmed Comparing gene annotation enrichment tools for functional modeling of agricultural microarray data
title_short Comparing gene annotation enrichment tools for functional modeling of agricultural microarray data
title_sort comparing gene annotation enrichment tools for functional modeling of agricultural microarray data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226198/
https://www.ncbi.nlm.nih.gov/pubmed/19811693
http://dx.doi.org/10.1186/1471-2105-10-S11-S9
work_keys_str_mv AT bergbarthjvanden comparinggeneannotationenrichmenttoolsforfunctionalmodelingofagriculturalmicroarraydata
AT thanthiriwattechamali comparinggeneannotationenrichmenttoolsforfunctionalmodelingofagriculturalmicroarraydata
AT mandaprashanti comparinggeneannotationenrichmenttoolsforfunctionalmodelingofagriculturalmicroarraydata
AT bridgessusanm comparinggeneannotationenrichmenttoolsforfunctionalmodelingofagriculturalmicroarraydata