Cargando…
Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data
BACKGROUND: Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assem...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3144025/ https://www.ncbi.nlm.nih.gov/pubmed/21749710 http://dx.doi.org/10.1186/1471-2105-12-282 |
_version_ | 1782208969966616576 |
---|---|
author | Lopez, David Casero, David Cokus, Shawn J Merchant, Sabeeha S Pellegrini, Matteo |
author_facet | Lopez, David Casero, David Cokus, Shawn J Merchant, Sabeeha S Pellegrini, Matteo |
author_sort | Lopez, David |
collection | PubMed |
description | BACKGROUND: Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. DESCRIPTION: The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of genes on KEGG pathway maps and batch gene identifier conversion. CONCLUSIONS: The Algal Functional Annotation Tool aims to provide an integrated data-mining environment for algal genomics by combining data from multiple annotation databases into a centralized tool. This site is designed to expedite the process of functional annotation and the interpretation of gene lists, such as those derived from high-throughput RNA-seq experiments. The tool is publicly available at http://pathways.mcdb.ucla.edu. |
format | Online Article Text |
id | pubmed-3144025 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31440252011-07-27 Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data Lopez, David Casero, David Cokus, Shawn J Merchant, Sabeeha S Pellegrini, Matteo BMC Bioinformatics Database BACKGROUND: Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. DESCRIPTION: The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of genes on KEGG pathway maps and batch gene identifier conversion. CONCLUSIONS: The Algal Functional Annotation Tool aims to provide an integrated data-mining environment for algal genomics by combining data from multiple annotation databases into a centralized tool. This site is designed to expedite the process of functional annotation and the interpretation of gene lists, such as those derived from high-throughput RNA-seq experiments. The tool is publicly available at http://pathways.mcdb.ucla.edu. BioMed Central 2011-07-12 /pmc/articles/PMC3144025/ /pubmed/21749710 http://dx.doi.org/10.1186/1471-2105-12-282 Text en Copyright ©2011 Lopez et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Lopez, David Casero, David Cokus, Shawn J Merchant, Sabeeha S Pellegrini, Matteo Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data |
title | Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data |
title_full | Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data |
title_fullStr | Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data |
title_full_unstemmed | Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data |
title_short | Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data |
title_sort | algal functional annotation tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data |
topic | Database |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3144025/ https://www.ncbi.nlm.nih.gov/pubmed/21749710 http://dx.doi.org/10.1186/1471-2105-12-282 |
work_keys_str_mv | AT lopezdavid algalfunctionalannotationtoolawebbasedanalysissuitetofunctionallyinterpretlargegenelistsusingintegratedannotationandexpressiondata AT caserodavid algalfunctionalannotationtoolawebbasedanalysissuitetofunctionallyinterpretlargegenelistsusingintegratedannotationandexpressiondata AT cokusshawnj algalfunctionalannotationtoolawebbasedanalysissuitetofunctionallyinterpretlargegenelistsusingintegratedannotationandexpressiondata AT merchantsabeehas algalfunctionalannotationtoolawebbasedanalysissuitetofunctionallyinterpretlargegenelistsusingintegratedannotationandexpressiondata AT pellegrinimatteo algalfunctionalannotationtoolawebbasedanalysissuitetofunctionallyinterpretlargegenelistsusingintegratedannotationandexpressiondata |