Cargando…
A novel function prediction approach using protein overlap networks
BACKGROUND: Construction of a reliable network remains the bottleneck for network-based protein function prediction. We built an artificial network model called protein overlap network (PON) for the entire genome of yeast, fly, worm, and human, respectively. Each node of the network represents a pro...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3720179/ https://www.ncbi.nlm.nih.gov/pubmed/23866986 http://dx.doi.org/10.1186/1752-0509-7-61 |
_version_ | 1782277930948231168 |
---|---|
author | Liang, Shide Zheng, Dandan Standley, Daron M Guo, Huarong Zhang, Chi |
author_facet | Liang, Shide Zheng, Dandan Standley, Daron M Guo, Huarong Zhang, Chi |
author_sort | Liang, Shide |
collection | PubMed |
description | BACKGROUND: Construction of a reliable network remains the bottleneck for network-based protein function prediction. We built an artificial network model called protein overlap network (PON) for the entire genome of yeast, fly, worm, and human, respectively. Each node of the network represents a protein, and two proteins are connected if they share a domain according to InterPro database. RESULTS: The function of a protein can be predicted by counting the occurrence frequency of GO (gene ontology) terms associated with domains of direct neighbors. The average success rate and coverage were 34.3% and 43.9%, respectively, for the test genomes, and were increased to 37.9% and 51.3% when a composite PON of the four species was used for the prediction. As a comparison, the success rate was 7.0% in the random control procedure. We also made predictions with GO term annotations of the second layer nodes using the composite network and obtained an impressive success rate (>30%) and coverage (>30%), even for small genomes. Further improvement was achieved by statistical analysis of manually annotated GO terms for each neighboring protein. CONCLUSIONS: The PONs are composed of dense modules accompanied by a few long distance connections. Based on the PONs, we developed multiple approaches effective for protein function prediction. |
format | Online Article Text |
id | pubmed-3720179 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-37201792013-07-26 A novel function prediction approach using protein overlap networks Liang, Shide Zheng, Dandan Standley, Daron M Guo, Huarong Zhang, Chi BMC Syst Biol Research Article BACKGROUND: Construction of a reliable network remains the bottleneck for network-based protein function prediction. We built an artificial network model called protein overlap network (PON) for the entire genome of yeast, fly, worm, and human, respectively. Each node of the network represents a protein, and two proteins are connected if they share a domain according to InterPro database. RESULTS: The function of a protein can be predicted by counting the occurrence frequency of GO (gene ontology) terms associated with domains of direct neighbors. The average success rate and coverage were 34.3% and 43.9%, respectively, for the test genomes, and were increased to 37.9% and 51.3% when a composite PON of the four species was used for the prediction. As a comparison, the success rate was 7.0% in the random control procedure. We also made predictions with GO term annotations of the second layer nodes using the composite network and obtained an impressive success rate (>30%) and coverage (>30%), even for small genomes. Further improvement was achieved by statistical analysis of manually annotated GO terms for each neighboring protein. CONCLUSIONS: The PONs are composed of dense modules accompanied by a few long distance connections. Based on the PONs, we developed multiple approaches effective for protein function prediction. BioMed Central 2013-07-17 /pmc/articles/PMC3720179/ /pubmed/23866986 http://dx.doi.org/10.1186/1752-0509-7-61 Text en Copyright © 2013 Liang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Liang, Shide Zheng, Dandan Standley, Daron M Guo, Huarong Zhang, Chi A novel function prediction approach using protein overlap networks |
title | A novel function prediction approach using protein overlap networks |
title_full | A novel function prediction approach using protein overlap networks |
title_fullStr | A novel function prediction approach using protein overlap networks |
title_full_unstemmed | A novel function prediction approach using protein overlap networks |
title_short | A novel function prediction approach using protein overlap networks |
title_sort | novel function prediction approach using protein overlap networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3720179/ https://www.ncbi.nlm.nih.gov/pubmed/23866986 http://dx.doi.org/10.1186/1752-0509-7-61 |
work_keys_str_mv | AT liangshide anovelfunctionpredictionapproachusingproteinoverlapnetworks AT zhengdandan anovelfunctionpredictionapproachusingproteinoverlapnetworks AT standleydaronm anovelfunctionpredictionapproachusingproteinoverlapnetworks AT guohuarong anovelfunctionpredictionapproachusingproteinoverlapnetworks AT zhangchi anovelfunctionpredictionapproachusingproteinoverlapnetworks AT liangshide novelfunctionpredictionapproachusingproteinoverlapnetworks AT zhengdandan novelfunctionpredictionapproachusingproteinoverlapnetworks AT standleydaronm novelfunctionpredictionapproachusingproteinoverlapnetworks AT guohuarong novelfunctionpredictionapproachusingproteinoverlapnetworks AT zhangchi novelfunctionpredictionapproachusingproteinoverlapnetworks |