Cargando…
Hierarchical Ensemble Methods for Protein Function Prediction
Protein function prediction is a complex multiclass multilabel classification problem, characterized by multiple issues such as the incompleteness of the available annotations, the integration of multiple sources of high dimensional biomolecular data, the unbalance of several functional classes, and...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393075/ https://www.ncbi.nlm.nih.gov/pubmed/25937954 http://dx.doi.org/10.1155/2014/901419 |
_version_ | 1782366095733161984 |
---|---|
author | Valentini, Giorgio |
author_facet | Valentini, Giorgio |
author_sort | Valentini, Giorgio |
collection | PubMed |
description | Protein function prediction is a complex multiclass multilabel classification problem, characterized by multiple issues such as the incompleteness of the available annotations, the integration of multiple sources of high dimensional biomolecular data, the unbalance of several functional classes, and the difficulty of univocally determining negative examples. Moreover, the hierarchical relationships between functional classes that characterize both the Gene Ontology and FunCat taxonomies motivate the development of hierarchy-aware prediction methods that showed significantly better performances than hierarchical-unaware “flat” prediction methods. In this paper, we provide a comprehensive review of hierarchical methods for protein function prediction based on ensembles of learning machines. According to this general approach, a separate learning machine is trained to learn a specific functional term and then the resulting predictions are assembled in a “consensus” ensemble decision, taking into account the hierarchical relationships between classes. The main hierarchical ensemble methods proposed in the literature are discussed in the context of existing computational methods for protein function prediction, highlighting their characteristics, advantages, and limitations. Open problems of this exciting research area of computational biology are finally considered, outlining novel perspectives for future research. |
format | Online Article Text |
id | pubmed-4393075 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-43930752015-05-03 Hierarchical Ensemble Methods for Protein Function Prediction Valentini, Giorgio ISRN Bioinform Review Article Protein function prediction is a complex multiclass multilabel classification problem, characterized by multiple issues such as the incompleteness of the available annotations, the integration of multiple sources of high dimensional biomolecular data, the unbalance of several functional classes, and the difficulty of univocally determining negative examples. Moreover, the hierarchical relationships between functional classes that characterize both the Gene Ontology and FunCat taxonomies motivate the development of hierarchy-aware prediction methods that showed significantly better performances than hierarchical-unaware “flat” prediction methods. In this paper, we provide a comprehensive review of hierarchical methods for protein function prediction based on ensembles of learning machines. According to this general approach, a separate learning machine is trained to learn a specific functional term and then the resulting predictions are assembled in a “consensus” ensemble decision, taking into account the hierarchical relationships between classes. The main hierarchical ensemble methods proposed in the literature are discussed in the context of existing computational methods for protein function prediction, highlighting their characteristics, advantages, and limitations. Open problems of this exciting research area of computational biology are finally considered, outlining novel perspectives for future research. Hindawi Publishing Corporation 2014-05-04 /pmc/articles/PMC4393075/ /pubmed/25937954 http://dx.doi.org/10.1155/2014/901419 Text en Copyright © 2014 Giorgio Valentini. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Review Article Valentini, Giorgio Hierarchical Ensemble Methods for Protein Function Prediction |
title | Hierarchical Ensemble Methods for Protein Function Prediction |
title_full | Hierarchical Ensemble Methods for Protein Function Prediction |
title_fullStr | Hierarchical Ensemble Methods for Protein Function Prediction |
title_full_unstemmed | Hierarchical Ensemble Methods for Protein Function Prediction |
title_short | Hierarchical Ensemble Methods for Protein Function Prediction |
title_sort | hierarchical ensemble methods for protein function prediction |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393075/ https://www.ncbi.nlm.nih.gov/pubmed/25937954 http://dx.doi.org/10.1155/2014/901419 |
work_keys_str_mv | AT valentinigiorgio hierarchicalensemblemethodsforproteinfunctionprediction |