Cargando…
BLANNOTATOR: enhanced homology-based function prediction of bacterial proteins
BACKGROUND: Automated function prediction has played a central role in determining the biological functions of bacterial proteins. Typically, protein function annotation relies on homology, and function is inferred from other proteins with similar sequences. This approach has become popular in bacte...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3386020/ https://www.ncbi.nlm.nih.gov/pubmed/22335941 http://dx.doi.org/10.1186/1471-2105-13-33 |
_version_ | 1782236933438570496 |
---|---|
author | Kankainen, Matti Ojala, Teija Holm, Liisa |
author_facet | Kankainen, Matti Ojala, Teija Holm, Liisa |
author_sort | Kankainen, Matti |
collection | PubMed |
description | BACKGROUND: Automated function prediction has played a central role in determining the biological functions of bacterial proteins. Typically, protein function annotation relies on homology, and function is inferred from other proteins with similar sequences. This approach has become popular in bacterial genomics because it is one of the few methods that is practical for large datasets and because it does not require additional functional genomics experiments. However, the existing solutions produce erroneous predictions in many cases, especially when query sequences have low levels of identity with the annotated source protein. This problem has created a pressing need for improvements in homology-based annotation. RESULTS: We present an automated method for the functional annotation of bacterial protein sequences. Based on sequence similarity searches, BLANNOTATOR accurately annotates query sequences with one-line summary descriptions of protein function. It groups sequences identified by BLAST into subsets according to their annotation and bases its prediction on a set of sequences with consistent functional information. We show the results of BLANNOTATOR's performance in sets of bacterial proteins with known functions. We simulated the annotation process for 3090 SWISS-PROT proteins using a database in its state preceding the functional characterisation of the query protein. For this dataset, our method outperformed the five others that we tested, and the improved performance was maintained even in the absence of highly related sequence hits. We further demonstrate the value of our tool by analysing the putative proteome of Lactobacillus crispatus strain ST1. CONCLUSIONS: BLANNOTATOR is an accurate method for bacterial protein function prediction. It is practical for genome-scale data and does not require pre-existing sequence clustering; thus, this method suits the needs of bacterial genome and metagenome researchers. The method and a web-server are available at http://ekhidna.biocenter.helsinki.fi/poxo/blannotator/. |
format | Online Article Text |
id | pubmed-3386020 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-33860202012-06-29 BLANNOTATOR: enhanced homology-based function prediction of bacterial proteins Kankainen, Matti Ojala, Teija Holm, Liisa BMC Bioinformatics Methodology Article BACKGROUND: Automated function prediction has played a central role in determining the biological functions of bacterial proteins. Typically, protein function annotation relies on homology, and function is inferred from other proteins with similar sequences. This approach has become popular in bacterial genomics because it is one of the few methods that is practical for large datasets and because it does not require additional functional genomics experiments. However, the existing solutions produce erroneous predictions in many cases, especially when query sequences have low levels of identity with the annotated source protein. This problem has created a pressing need for improvements in homology-based annotation. RESULTS: We present an automated method for the functional annotation of bacterial protein sequences. Based on sequence similarity searches, BLANNOTATOR accurately annotates query sequences with one-line summary descriptions of protein function. It groups sequences identified by BLAST into subsets according to their annotation and bases its prediction on a set of sequences with consistent functional information. We show the results of BLANNOTATOR's performance in sets of bacterial proteins with known functions. We simulated the annotation process for 3090 SWISS-PROT proteins using a database in its state preceding the functional characterisation of the query protein. For this dataset, our method outperformed the five others that we tested, and the improved performance was maintained even in the absence of highly related sequence hits. We further demonstrate the value of our tool by analysing the putative proteome of Lactobacillus crispatus strain ST1. CONCLUSIONS: BLANNOTATOR is an accurate method for bacterial protein function prediction. It is practical for genome-scale data and does not require pre-existing sequence clustering; thus, this method suits the needs of bacterial genome and metagenome researchers. The method and a web-server are available at http://ekhidna.biocenter.helsinki.fi/poxo/blannotator/. BioMed Central 2012-02-15 /pmc/articles/PMC3386020/ /pubmed/22335941 http://dx.doi.org/10.1186/1471-2105-13-33 Text en Copyright ©2012 Kankainen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Kankainen, Matti Ojala, Teija Holm, Liisa BLANNOTATOR: enhanced homology-based function prediction of bacterial proteins |
title | BLANNOTATOR: enhanced homology-based function prediction of bacterial proteins |
title_full | BLANNOTATOR: enhanced homology-based function prediction of bacterial proteins |
title_fullStr | BLANNOTATOR: enhanced homology-based function prediction of bacterial proteins |
title_full_unstemmed | BLANNOTATOR: enhanced homology-based function prediction of bacterial proteins |
title_short | BLANNOTATOR: enhanced homology-based function prediction of bacterial proteins |
title_sort | blannotator: enhanced homology-based function prediction of bacterial proteins |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3386020/ https://www.ncbi.nlm.nih.gov/pubmed/22335941 http://dx.doi.org/10.1186/1471-2105-13-33 |
work_keys_str_mv | AT kankainenmatti blannotatorenhancedhomologybasedfunctionpredictionofbacterialproteins AT ojalateija blannotatorenhancedhomologybasedfunctionpredictionofbacterialproteins AT holmliisa blannotatorenhancedhomologybasedfunctionpredictionofbacterialproteins |