Cargando…
Large-scale identification of human protein function using topological features of interaction network
The annotation of protein function is a vital step to elucidate the essence of life at a molecular level, and it is also meritorious in biomedical and pharmaceutical industry. Developments of sequencing technology result in constant expansion of the gap between the number of the known sequences and...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5111120/ https://www.ncbi.nlm.nih.gov/pubmed/27849060 http://dx.doi.org/10.1038/srep37179 |
_version_ | 1782467806895276032 |
---|---|
author | Li, Zhanchao Liu, Zhiqing Zhong, Wenqian Huang, Menghua Wu, Na Xie, Yun Dai, Zong Zou, Xiaoyong |
author_facet | Li, Zhanchao Liu, Zhiqing Zhong, Wenqian Huang, Menghua Wu, Na Xie, Yun Dai, Zong Zou, Xiaoyong |
author_sort | Li, Zhanchao |
collection | PubMed |
description | The annotation of protein function is a vital step to elucidate the essence of life at a molecular level, and it is also meritorious in biomedical and pharmaceutical industry. Developments of sequencing technology result in constant expansion of the gap between the number of the known sequences and their functions. Therefore, it is indispensable to develop a computational method for the annotation of protein function. Herein, a novel method is proposed to identify protein function based on the weighted human protein-protein interaction network and graph theory. The network topology features with local and global information are presented to characterise proteins. The minimum redundancy maximum relevance algorithm is used to select 227 optimized feature subsets and support vector machine technique is utilized to build the prediction models. The performance of current method is assessed through 10-fold cross-validation test, and the range of accuracies is from 67.63% to 100%. Comparing with other annotation methods, the proposed way possesses a 50% improvement in the predictive accuracy. Generally, such network topology features provide insights into the relationship between protein functions and network architectures. The source code of Matlab is freely available on request from the authors. |
format | Online Article Text |
id | pubmed-5111120 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-51111202016-11-23 Large-scale identification of human protein function using topological features of interaction network Li, Zhanchao Liu, Zhiqing Zhong, Wenqian Huang, Menghua Wu, Na Xie, Yun Dai, Zong Zou, Xiaoyong Sci Rep Article The annotation of protein function is a vital step to elucidate the essence of life at a molecular level, and it is also meritorious in biomedical and pharmaceutical industry. Developments of sequencing technology result in constant expansion of the gap between the number of the known sequences and their functions. Therefore, it is indispensable to develop a computational method for the annotation of protein function. Herein, a novel method is proposed to identify protein function based on the weighted human protein-protein interaction network and graph theory. The network topology features with local and global information are presented to characterise proteins. The minimum redundancy maximum relevance algorithm is used to select 227 optimized feature subsets and support vector machine technique is utilized to build the prediction models. The performance of current method is assessed through 10-fold cross-validation test, and the range of accuracies is from 67.63% to 100%. Comparing with other annotation methods, the proposed way possesses a 50% improvement in the predictive accuracy. Generally, such network topology features provide insights into the relationship between protein functions and network architectures. The source code of Matlab is freely available on request from the authors. Nature Publishing Group 2016-11-16 /pmc/articles/PMC5111120/ /pubmed/27849060 http://dx.doi.org/10.1038/srep37179 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Li, Zhanchao Liu, Zhiqing Zhong, Wenqian Huang, Menghua Wu, Na Xie, Yun Dai, Zong Zou, Xiaoyong Large-scale identification of human protein function using topological features of interaction network |
title | Large-scale identification of human protein function using topological features of interaction network |
title_full | Large-scale identification of human protein function using topological features of interaction network |
title_fullStr | Large-scale identification of human protein function using topological features of interaction network |
title_full_unstemmed | Large-scale identification of human protein function using topological features of interaction network |
title_short | Large-scale identification of human protein function using topological features of interaction network |
title_sort | large-scale identification of human protein function using topological features of interaction network |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5111120/ https://www.ncbi.nlm.nih.gov/pubmed/27849060 http://dx.doi.org/10.1038/srep37179 |
work_keys_str_mv | AT lizhanchao largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork AT liuzhiqing largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork AT zhongwenqian largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork AT huangmenghua largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork AT wuna largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork AT xieyun largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork AT daizong largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork AT zouxiaoyong largescaleidentificationofhumanproteinfunctionusingtopologicalfeaturesofinteractionnetwork |