Cargando…

Large-scale protein function prediction using heterogeneous ensembles

Heterogeneous ensembles are an effective approach in scenarios where the ideal data type and/or individual predictor are unclear for a given problem. These ensembles have shown promise for protein function prediction (PFP), but their ability to improve PFP at a large scale is unclear. The overall go...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Linhua, Law, Jeffrey, Kale, Shiv D., Murali, T. M., Pandey, Gaurav
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	F1000 Research Limited 2018
Materias:	Method Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6221071/ https://www.ncbi.nlm.nih.gov/pubmed/30450194 http://dx.doi.org/10.12688/f1000research.16415.1

Descripción
Sumario:	Heterogeneous ensembles are an effective approach in scenarios where the ideal data type and/or individual predictor are unclear for a given problem. These ensembles have shown promise for protein function prediction (PFP), but their ability to improve PFP at a large scale is unclear. The overall goal of this study is to critically assess this ability of a variety of heterogeneous ensemble methods across a multitude of functional terms, proteins and organisms. Our results show that these methods, especially Stacking using Logistic Regression, indeed produce more accurate predictions for a variety of Gene Ontology terms differing in size and specificity. To enable the application of these methods to other related problems, we have publicly shared the HPC-enabled code underlying this work as LargeGOPred ( https://github.com/GauravPandeyLab/LargeGOPred).

Large-scale protein function prediction using heterogeneous ensembles

Ejemplares similares