Cargando…

A novel function prediction approach using protein overlap networks

BACKGROUND: Construction of a reliable network remains the bottleneck for network-based protein function prediction. We built an artificial network model called protein overlap network (PON) for the entire genome of yeast, fly, worm, and human, respectively. Each node of the network represents a pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Shide, Zheng, Dandan, Standley, Daron M, Guo, Huarong, Zhang, Chi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3720179/
https://www.ncbi.nlm.nih.gov/pubmed/23866986
http://dx.doi.org/10.1186/1752-0509-7-61
_version_ 1782277930948231168
author Liang, Shide
Zheng, Dandan
Standley, Daron M
Guo, Huarong
Zhang, Chi
author_facet Liang, Shide
Zheng, Dandan
Standley, Daron M
Guo, Huarong
Zhang, Chi
author_sort Liang, Shide
collection PubMed
description BACKGROUND: Construction of a reliable network remains the bottleneck for network-based protein function prediction. We built an artificial network model called protein overlap network (PON) for the entire genome of yeast, fly, worm, and human, respectively. Each node of the network represents a protein, and two proteins are connected if they share a domain according to InterPro database. RESULTS: The function of a protein can be predicted by counting the occurrence frequency of GO (gene ontology) terms associated with domains of direct neighbors. The average success rate and coverage were 34.3% and 43.9%, respectively, for the test genomes, and were increased to 37.9% and 51.3% when a composite PON of the four species was used for the prediction. As a comparison, the success rate was 7.0% in the random control procedure. We also made predictions with GO term annotations of the second layer nodes using the composite network and obtained an impressive success rate (>30%) and coverage (>30%), even for small genomes. Further improvement was achieved by statistical analysis of manually annotated GO terms for each neighboring protein. CONCLUSIONS: The PONs are composed of dense modules accompanied by a few long distance connections. Based on the PONs, we developed multiple approaches effective for protein function prediction.
format Online
Article
Text
id pubmed-3720179
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37201792013-07-26 A novel function prediction approach using protein overlap networks Liang, Shide Zheng, Dandan Standley, Daron M Guo, Huarong Zhang, Chi BMC Syst Biol Research Article BACKGROUND: Construction of a reliable network remains the bottleneck for network-based protein function prediction. We built an artificial network model called protein overlap network (PON) for the entire genome of yeast, fly, worm, and human, respectively. Each node of the network represents a protein, and two proteins are connected if they share a domain according to InterPro database. RESULTS: The function of a protein can be predicted by counting the occurrence frequency of GO (gene ontology) terms associated with domains of direct neighbors. The average success rate and coverage were 34.3% and 43.9%, respectively, for the test genomes, and were increased to 37.9% and 51.3% when a composite PON of the four species was used for the prediction. As a comparison, the success rate was 7.0% in the random control procedure. We also made predictions with GO term annotations of the second layer nodes using the composite network and obtained an impressive success rate (>30%) and coverage (>30%), even for small genomes. Further improvement was achieved by statistical analysis of manually annotated GO terms for each neighboring protein. CONCLUSIONS: The PONs are composed of dense modules accompanied by a few long distance connections. Based on the PONs, we developed multiple approaches effective for protein function prediction. BioMed Central 2013-07-17 /pmc/articles/PMC3720179/ /pubmed/23866986 http://dx.doi.org/10.1186/1752-0509-7-61 Text en Copyright © 2013 Liang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Liang, Shide
Zheng, Dandan
Standley, Daron M
Guo, Huarong
Zhang, Chi
A novel function prediction approach using protein overlap networks
title A novel function prediction approach using protein overlap networks
title_full A novel function prediction approach using protein overlap networks
title_fullStr A novel function prediction approach using protein overlap networks
title_full_unstemmed A novel function prediction approach using protein overlap networks
title_short A novel function prediction approach using protein overlap networks
title_sort novel function prediction approach using protein overlap networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3720179/
https://www.ncbi.nlm.nih.gov/pubmed/23866986
http://dx.doi.org/10.1186/1752-0509-7-61
work_keys_str_mv AT liangshide anovelfunctionpredictionapproachusingproteinoverlapnetworks
AT zhengdandan anovelfunctionpredictionapproachusingproteinoverlapnetworks
AT standleydaronm anovelfunctionpredictionapproachusingproteinoverlapnetworks
AT guohuarong anovelfunctionpredictionapproachusingproteinoverlapnetworks
AT zhangchi anovelfunctionpredictionapproachusingproteinoverlapnetworks
AT liangshide novelfunctionpredictionapproachusingproteinoverlapnetworks
AT zhengdandan novelfunctionpredictionapproachusingproteinoverlapnetworks
AT standleydaronm novelfunctionpredictionapproachusingproteinoverlapnetworks
AT guohuarong novelfunctionpredictionapproachusingproteinoverlapnetworks
AT zhangchi novelfunctionpredictionapproachusingproteinoverlapnetworks