Cargando…

Large-scale identification of human protein function using topological features of interaction network

The annotation of protein function is a vital step to elucidate the essence of life at a molecular level, and it is also meritorious in biomedical and pharmaceutical industry. Developments of sequencing technology result in constant expansion of the gap between the number of the known sequences and...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zhanchao, Liu, Zhiqing, Zhong, Wenqian, Huang, Menghua, Wu, Na, Xie, Yun, Dai, Zong, Zou, Xiaoyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5111120/
https://www.ncbi.nlm.nih.gov/pubmed/27849060
http://dx.doi.org/10.1038/srep37179
_version_ 1782467806895276032
author Li, Zhanchao
Liu, Zhiqing
Zhong, Wenqian
Huang, Menghua
Wu, Na
Xie, Yun
Dai, Zong
Zou, Xiaoyong
author_facet Li, Zhanchao
Liu, Zhiqing
Zhong, Wenqian
Huang, Menghua
Wu, Na
Xie, Yun
Dai, Zong
Zou, Xiaoyong
author_sort Li, Zhanchao
collection PubMed
description The annotation of protein function is a vital step to elucidate the essence of life at a molecular level, and it is also meritorious in biomedical and pharmaceutical industry. Developments of sequencing technology result in constant expansion of the gap between the number of the known sequences and their functions. Therefore, it is indispensable to develop a computational method for the annotation of protein function. Herein, a novel method is proposed to identify protein function based on the weighted human protein-protein interaction network and graph theory. The network topology features with local and global information are presented to characterise proteins. The minimum redundancy maximum relevance algorithm is used to select 227 optimized feature subsets and support vector machine technique is utilized to build the prediction models. The performance of current method is assessed through 10-fold cross-validation test, and the range of accuracies is from 67.63% to 100%. Comparing with other annotation methods, the proposed way possesses a 50% improvement in the predictive accuracy. Generally, such network topology features provide insights into the relationship between protein functions and network architectures. The source code of Matlab is freely available on request from the authors.
format Online
Article
Text
id pubmed-5111120
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-51111202016-11-23 Large-scale identification of human protein function using topological features of interaction network Li, Zhanchao Liu, Zhiqing Zhong, Wenqian Huang, Menghua Wu, Na Xie, Yun Dai, Zong Zou, Xiaoyong Sci Rep Article The annotation of protein function is a vital step to elucidate the essence of life at a molecular level, and it is also meritorious in biomedical and pharmaceutical industry. Developments of sequencing technology result in constant expansion of the gap between the number of the known sequences and their functions. Therefore, it is indispensable to develop a computational method for the annotation of protein function. Herein, a novel method is proposed to identify protein function based on the weighted human protein-protein interaction network and graph theory. The network topology features with local and global information are presented to characterise proteins. The minimum redundancy maximum relevance algorithm is used to select 227 optimized feature subsets and support vector machine technique is utilized to build the prediction models. The performance of current method is assessed through 10-fold cross-validation test, and the range of accuracies is from 67.63% to 100%. Comparing with other annotation methods, the proposed way possesses a 50% improvement in the predictive accuracy. Generally, such network topology features provide insights into the relationship between protein functions and network architectures. The source code of Matlab is freely available on request from the authors. Nature Publishing Group 2016-11-16 /pmc/articles/PMC5111120/ /pubmed/27849060 http://dx.doi.org/10.1038/srep37179 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Li, Zhanchao
Liu, Zhiqing
Zhong, Wenqian
Huang, Menghua
Wu, Na
Xie, Yun
Dai, Zong
Zou, Xiaoyong
Large-scale identification of human protein function using topological features of interaction network
title Large-scale identification of human protein function using topological features of interaction network
title_full Large-scale identification of human protein function using topological features of interaction network
title_fullStr Large-scale identification of human protein function using topological features of interaction network
title_full_unstemmed Large-scale identification of human protein function using topological features of interaction network
title_short Large-scale identification of human protein function using topological features of interaction network
title_sort large-scale identification of human protein function using topological features of interaction network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5111120/
https://www.ncbi.nlm.nih.gov/pubmed/27849060
http://dx.doi.org/10.1038/srep37179
work_keys_str_mv AT lizhanchao largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork
AT liuzhiqing largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork
AT zhongwenqian largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork
AT huangmenghua largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork
AT wuna largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork
AT xieyun largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork
AT daizong largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork
AT zouxiaoyong largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork