Cargando…

NetGO: improving large-scale protein function prediction with massive network information

Automated function prediction (AFP) of proteins is of great significance in biology. AFP can be regarded as a problem of the large-scale multi-label classification where a protein can be associated with multiple gene ontology terms as its labels. Based on our GOLabeler—a state-of-the-art method for...

Descripción completa

Detalles Bibliográficos
Autores principales: You, Ronghui, Yao, Shuwei, Xiong, Yi, Huang, Xiaodi, Sun, Fengzhu, Mamitsuka, Hiroshi, Zhu, Shanfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6602452/
https://www.ncbi.nlm.nih.gov/pubmed/31106361
http://dx.doi.org/10.1093/nar/gkz388
_version_ 1783431380360757248
author You, Ronghui
Yao, Shuwei
Xiong, Yi
Huang, Xiaodi
Sun, Fengzhu
Mamitsuka, Hiroshi
Zhu, Shanfeng
author_facet You, Ronghui
Yao, Shuwei
Xiong, Yi
Huang, Xiaodi
Sun, Fengzhu
Mamitsuka, Hiroshi
Zhu, Shanfeng
author_sort You, Ronghui
collection PubMed
description Automated function prediction (AFP) of proteins is of great significance in biology. AFP can be regarded as a problem of the large-scale multi-label classification where a protein can be associated with multiple gene ontology terms as its labels. Based on our GOLabeler—a state-of-the-art method for the third critical assessment of functional annotation (CAFA3), in this paper we propose NetGO, a web server that is able to further improve the performance of the large-scale AFP by incorporating massive protein-protein network information. Specifically, the advantages of NetGO are threefold in using network information: (i) NetGO relies on a powerful learning to rank framework from machine learning to effectively integrate both sequence and network information of proteins; (ii) NetGO uses the massive network information of all species (>2000) in STRING (other than only some specific species) and (iii) NetGO still can use network information to annotate a protein by homology transfer, even if it is not contained in STRING. Separating training and testing data with the same time-delayed settings of CAFA, we comprehensively examined the performance of NetGO. Experimental results have clearly demonstrated that NetGO significantly outperforms GOLabeler and other competing methods. The NetGO web server is freely available at http://issubmission.sjtu.edu.cn/netgo/.
format Online
Article
Text
id pubmed-6602452
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66024522019-07-05 NetGO: improving large-scale protein function prediction with massive network information You, Ronghui Yao, Shuwei Xiong, Yi Huang, Xiaodi Sun, Fengzhu Mamitsuka, Hiroshi Zhu, Shanfeng Nucleic Acids Res Web Server Issue Automated function prediction (AFP) of proteins is of great significance in biology. AFP can be regarded as a problem of the large-scale multi-label classification where a protein can be associated with multiple gene ontology terms as its labels. Based on our GOLabeler—a state-of-the-art method for the third critical assessment of functional annotation (CAFA3), in this paper we propose NetGO, a web server that is able to further improve the performance of the large-scale AFP by incorporating massive protein-protein network information. Specifically, the advantages of NetGO are threefold in using network information: (i) NetGO relies on a powerful learning to rank framework from machine learning to effectively integrate both sequence and network information of proteins; (ii) NetGO uses the massive network information of all species (>2000) in STRING (other than only some specific species) and (iii) NetGO still can use network information to annotate a protein by homology transfer, even if it is not contained in STRING. Separating training and testing data with the same time-delayed settings of CAFA, we comprehensively examined the performance of NetGO. Experimental results have clearly demonstrated that NetGO significantly outperforms GOLabeler and other competing methods. The NetGO web server is freely available at http://issubmission.sjtu.edu.cn/netgo/. Oxford University Press 2019-07-02 2019-05-20 /pmc/articles/PMC6602452/ /pubmed/31106361 http://dx.doi.org/10.1093/nar/gkz388 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Web Server Issue
You, Ronghui
Yao, Shuwei
Xiong, Yi
Huang, Xiaodi
Sun, Fengzhu
Mamitsuka, Hiroshi
Zhu, Shanfeng
NetGO: improving large-scale protein function prediction with massive network information
title NetGO: improving large-scale protein function prediction with massive network information
title_full NetGO: improving large-scale protein function prediction with massive network information
title_fullStr NetGO: improving large-scale protein function prediction with massive network information
title_full_unstemmed NetGO: improving large-scale protein function prediction with massive network information
title_short NetGO: improving large-scale protein function prediction with massive network information
title_sort netgo: improving large-scale protein function prediction with massive network information
topic Web Server Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6602452/
https://www.ncbi.nlm.nih.gov/pubmed/31106361
http://dx.doi.org/10.1093/nar/gkz388
work_keys_str_mv AT youronghui netgoimprovinglargescaleproteinfunctionpredictionwithmassivenetworkinformation
AT yaoshuwei netgoimprovinglargescaleproteinfunctionpredictionwithmassivenetworkinformation
AT xiongyi netgoimprovinglargescaleproteinfunctionpredictionwithmassivenetworkinformation
AT huangxiaodi netgoimprovinglargescaleproteinfunctionpredictionwithmassivenetworkinformation
AT sunfengzhu netgoimprovinglargescaleproteinfunctionpredictionwithmassivenetworkinformation
AT mamitsukahiroshi netgoimprovinglargescaleproteinfunctionpredictionwithmassivenetworkinformation
AT zhushanfeng netgoimprovinglargescaleproteinfunctionpredictionwithmassivenetworkinformation