Cargando…

Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data

BACKGROUND: Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assem...

Descripción completa

Detalles Bibliográficos
Autores principales: Lopez, David, Casero, David, Cokus, Shawn J, Merchant, Sabeeha S, Pellegrini, Matteo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3144025/
https://www.ncbi.nlm.nih.gov/pubmed/21749710
http://dx.doi.org/10.1186/1471-2105-12-282
_version_ 1782208969966616576
author Lopez, David
Casero, David
Cokus, Shawn J
Merchant, Sabeeha S
Pellegrini, Matteo
author_facet Lopez, David
Casero, David
Cokus, Shawn J
Merchant, Sabeeha S
Pellegrini, Matteo
author_sort Lopez, David
collection PubMed
description BACKGROUND: Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. DESCRIPTION: The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of genes on KEGG pathway maps and batch gene identifier conversion. CONCLUSIONS: The Algal Functional Annotation Tool aims to provide an integrated data-mining environment for algal genomics by combining data from multiple annotation databases into a centralized tool. This site is designed to expedite the process of functional annotation and the interpretation of gene lists, such as those derived from high-throughput RNA-seq experiments. The tool is publicly available at http://pathways.mcdb.ucla.edu.
format Online
Article
Text
id pubmed-3144025
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31440252011-07-27 Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data Lopez, David Casero, David Cokus, Shawn J Merchant, Sabeeha S Pellegrini, Matteo BMC Bioinformatics Database BACKGROUND: Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. DESCRIPTION: The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of genes on KEGG pathway maps and batch gene identifier conversion. CONCLUSIONS: The Algal Functional Annotation Tool aims to provide an integrated data-mining environment for algal genomics by combining data from multiple annotation databases into a centralized tool. This site is designed to expedite the process of functional annotation and the interpretation of gene lists, such as those derived from high-throughput RNA-seq experiments. The tool is publicly available at http://pathways.mcdb.ucla.edu. BioMed Central 2011-07-12 /pmc/articles/PMC3144025/ /pubmed/21749710 http://dx.doi.org/10.1186/1471-2105-12-282 Text en Copyright ©2011 Lopez et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Lopez, David
Casero, David
Cokus, Shawn J
Merchant, Sabeeha S
Pellegrini, Matteo
Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data
title Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data
title_full Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data
title_fullStr Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data
title_full_unstemmed Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data
title_short Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data
title_sort algal functional annotation tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3144025/
https://www.ncbi.nlm.nih.gov/pubmed/21749710
http://dx.doi.org/10.1186/1471-2105-12-282
work_keys_str_mv AT lopezdavid algalfunctionalannotationtoolawebbasedanalysissuitetofunctionallyinterpretlargegenelistsusingintegratedannotationandexpressiondata
AT caserodavid algalfunctionalannotationtoolawebbasedanalysissuitetofunctionallyinterpretlargegenelistsusingintegratedannotationandexpressiondata
AT cokusshawnj algalfunctionalannotationtoolawebbasedanalysissuitetofunctionallyinterpretlargegenelistsusingintegratedannotationandexpressiondata
AT merchantsabeehas algalfunctionalannotationtoolawebbasedanalysissuitetofunctionallyinterpretlargegenelistsusingintegratedannotationandexpressiondata
AT pellegrinimatteo algalfunctionalannotationtoolawebbasedanalysissuitetofunctionallyinterpretlargegenelistsusingintegratedannotationandexpressiondata