Cargando…

InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes

BACKGROUND: Although many genomic features have been used in the prediction of protein-protein interactions (PPIs), frequently only one is used in a computational method. After realizing the limited power in the prediction using only one genomic feature, investigators are now moving toward integrati...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Jingchun, Sun, Yan, Ding, Guohui, Liu, Qi, Wang, Chuan, He, Youyu, Shi, Tieliu, Li, Yixue, Zhao, Zhongming
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238723/
https://www.ncbi.nlm.nih.gov/pubmed/17963500
http://dx.doi.org/10.1186/1471-2105-8-414
_version_ 1782150450131238912
author Sun, Jingchun
Sun, Yan
Ding, Guohui
Liu, Qi
Wang, Chuan
He, Youyu
Shi, Tieliu
Li, Yixue
Zhao, Zhongming
author_facet Sun, Jingchun
Sun, Yan
Ding, Guohui
Liu, Qi
Wang, Chuan
He, Youyu
Shi, Tieliu
Li, Yixue
Zhao, Zhongming
author_sort Sun, Jingchun
collection PubMed
description BACKGROUND: Although many genomic features have been used in the prediction of protein-protein interactions (PPIs), frequently only one is used in a computational method. After realizing the limited power in the prediction using only one genomic feature, investigators are now moving toward integration. So far, there have been few integration studies for PPI prediction; one failed to yield appreciable improvement of prediction and the others did not conduct performance comparison. It remains unclear whether an integration of multiple genomic features can improve the PPI prediction and, if it can, how to integrate these features. RESULTS: In this study, we first performed a systematic evaluation on the PPI prediction in Escherichia coli (E. coli) by four genomic context based methods: the phylogenetic profile method, the gene cluster method, the gene fusion method, and the gene neighbor method. The number of predicted PPIs and the average degree in the predicted PPI networks varied greatly among the four methods. Further, no method outperformed the others when we tested using three well-defined positive datasets from the KEGG, EcoCyc, and DIP databases. Based on these comparisons, we developed a novel integrated method, named InPrePPI. InPrePPI first normalizes the AC value (an integrated value of the accuracy and coverage) of each method using three positive datasets, then calculates a weight for each method, and finally uses the weight to calculate an integrated score for each protein pair predicted by the four genomic context based methods. We demonstrate that InPrePPI outperforms each of the four individual methods and, in general, the other two existing integrated methods: the joint observation method and the integrated prediction method in STRING. These four methods and InPrePPI are implemented in a user-friendly web interface. CONCLUSION: This study evaluated the PPI prediction by four genomic context based methods, and presents an integrated evaluation method that shows better performance in E. coli.
format Text
id pubmed-2238723
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22387232008-02-12 InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes Sun, Jingchun Sun, Yan Ding, Guohui Liu, Qi Wang, Chuan He, Youyu Shi, Tieliu Li, Yixue Zhao, Zhongming BMC Bioinformatics Methodology Article BACKGROUND: Although many genomic features have been used in the prediction of protein-protein interactions (PPIs), frequently only one is used in a computational method. After realizing the limited power in the prediction using only one genomic feature, investigators are now moving toward integration. So far, there have been few integration studies for PPI prediction; one failed to yield appreciable improvement of prediction and the others did not conduct performance comparison. It remains unclear whether an integration of multiple genomic features can improve the PPI prediction and, if it can, how to integrate these features. RESULTS: In this study, we first performed a systematic evaluation on the PPI prediction in Escherichia coli (E. coli) by four genomic context based methods: the phylogenetic profile method, the gene cluster method, the gene fusion method, and the gene neighbor method. The number of predicted PPIs and the average degree in the predicted PPI networks varied greatly among the four methods. Further, no method outperformed the others when we tested using three well-defined positive datasets from the KEGG, EcoCyc, and DIP databases. Based on these comparisons, we developed a novel integrated method, named InPrePPI. InPrePPI first normalizes the AC value (an integrated value of the accuracy and coverage) of each method using three positive datasets, then calculates a weight for each method, and finally uses the weight to calculate an integrated score for each protein pair predicted by the four genomic context based methods. We demonstrate that InPrePPI outperforms each of the four individual methods and, in general, the other two existing integrated methods: the joint observation method and the integrated prediction method in STRING. These four methods and InPrePPI are implemented in a user-friendly web interface. CONCLUSION: This study evaluated the PPI prediction by four genomic context based methods, and presents an integrated evaluation method that shows better performance in E. coli. BioMed Central 2007-10-26 /pmc/articles/PMC2238723/ /pubmed/17963500 http://dx.doi.org/10.1186/1471-2105-8-414 Text en Copyright © 2007 Sun et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Sun, Jingchun
Sun, Yan
Ding, Guohui
Liu, Qi
Wang, Chuan
He, Youyu
Shi, Tieliu
Li, Yixue
Zhao, Zhongming
InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
title InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
title_full InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
title_fullStr InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
title_full_unstemmed InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
title_short InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
title_sort inpreppi: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238723/
https://www.ncbi.nlm.nih.gov/pubmed/17963500
http://dx.doi.org/10.1186/1471-2105-8-414
work_keys_str_mv AT sunjingchun inpreppianintegratedevaluationmethodbasedongenomiccontextforpredictingproteinproteininteractionsinprokaryoticgenomes
AT sunyan inpreppianintegratedevaluationmethodbasedongenomiccontextforpredictingproteinproteininteractionsinprokaryoticgenomes
AT dingguohui inpreppianintegratedevaluationmethodbasedongenomiccontextforpredictingproteinproteininteractionsinprokaryoticgenomes
AT liuqi inpreppianintegratedevaluationmethodbasedongenomiccontextforpredictingproteinproteininteractionsinprokaryoticgenomes
AT wangchuan inpreppianintegratedevaluationmethodbasedongenomiccontextforpredictingproteinproteininteractionsinprokaryoticgenomes
AT heyouyu inpreppianintegratedevaluationmethodbasedongenomiccontextforpredictingproteinproteininteractionsinprokaryoticgenomes
AT shitieliu inpreppianintegratedevaluationmethodbasedongenomiccontextforpredictingproteinproteininteractionsinprokaryoticgenomes
AT liyixue inpreppianintegratedevaluationmethodbasedongenomiccontextforpredictingproteinproteininteractionsinprokaryoticgenomes
AT zhaozhongming inpreppianintegratedevaluationmethodbasedongenomiccontextforpredictingproteinproteininteractionsinprokaryoticgenomes