Cargando…

Reconstruction of the Protein-Protein Interaction Network for Protein Complexes Identification by Walking on the Protein Pair Fingerprints Similarity Network

Identifying protein complexes from protein-protein interaction networks (PPINs) is important to understand the science of cellular organization and function. However, PPINs produced by high-throughput studies have high false discovery rate and only represent snapshot interaction information. Reconst...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Bo, Liu, Yu, Lin, Chi, Dong, Jie, Liu, Xiaoxia, He, Zengyou
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6067004/
https://www.ncbi.nlm.nih.gov/pubmed/30087694
http://dx.doi.org/10.3389/fgene.2018.00272
_version_ 1783343073576615936
author Xu, Bo
Liu, Yu
Lin, Chi
Dong, Jie
Liu, Xiaoxia
He, Zengyou
author_facet Xu, Bo
Liu, Yu
Lin, Chi
Dong, Jie
Liu, Xiaoxia
He, Zengyou
author_sort Xu, Bo
collection PubMed
description Identifying protein complexes from protein-protein interaction networks (PPINs) is important to understand the science of cellular organization and function. However, PPINs produced by high-throughput studies have high false discovery rate and only represent snapshot interaction information. Reconstructing higher quality PPINs is essential for protein complex identification. Here we present a Multi-Level PPINs reconstruction (MLPR) method for protein complexes detection. From existing PPINs, we generated full combinations of every two proteins. These protein pairs are represented as a vector which includes six different sources. Then the protein pairs with same vector are mapped to the same fingerprint ID. A fingerprint similarity network is constructed next, in which a vertex represents a protein pair fingerprint ID and each vertex is connected to its top 10 similar fingerprints by edges. After random walking on the fingerprints similarity network, each vertex got a score at the steady state. According to the score of protein pairs, we considered the top ranked ones as reliable PPI and the score as the weight of edge between two distinct proteins. Finally, we expanded clusters starting from seeded vertexes based on the new weighted reliable PPINs. Applying our method on the yeast PPINs, our algorithm achieved higher F-value in protein complexes detection than the-state-of-the-art methods. The interactions in our reconstructed PPI network have more significant biological relevance than the exiting PPI datasets, assessed by gene ontology. In addition, the performance of existing popular protein complexes detection methods are significantly improved on our reconstructed network.
format Online
Article
Text
id pubmed-6067004
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-60670042018-08-07 Reconstruction of the Protein-Protein Interaction Network for Protein Complexes Identification by Walking on the Protein Pair Fingerprints Similarity Network Xu, Bo Liu, Yu Lin, Chi Dong, Jie Liu, Xiaoxia He, Zengyou Front Genet Genetics Identifying protein complexes from protein-protein interaction networks (PPINs) is important to understand the science of cellular organization and function. However, PPINs produced by high-throughput studies have high false discovery rate and only represent snapshot interaction information. Reconstructing higher quality PPINs is essential for protein complex identification. Here we present a Multi-Level PPINs reconstruction (MLPR) method for protein complexes detection. From existing PPINs, we generated full combinations of every two proteins. These protein pairs are represented as a vector which includes six different sources. Then the protein pairs with same vector are mapped to the same fingerprint ID. A fingerprint similarity network is constructed next, in which a vertex represents a protein pair fingerprint ID and each vertex is connected to its top 10 similar fingerprints by edges. After random walking on the fingerprints similarity network, each vertex got a score at the steady state. According to the score of protein pairs, we considered the top ranked ones as reliable PPI and the score as the weight of edge between two distinct proteins. Finally, we expanded clusters starting from seeded vertexes based on the new weighted reliable PPINs. Applying our method on the yeast PPINs, our algorithm achieved higher F-value in protein complexes detection than the-state-of-the-art methods. The interactions in our reconstructed PPI network have more significant biological relevance than the exiting PPI datasets, assessed by gene ontology. In addition, the performance of existing popular protein complexes detection methods are significantly improved on our reconstructed network. Frontiers Media S.A. 2018-07-24 /pmc/articles/PMC6067004/ /pubmed/30087694 http://dx.doi.org/10.3389/fgene.2018.00272 Text en Copyright © 2018 Xu, Liu, Lin, Dong, Liu and He. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xu, Bo
Liu, Yu
Lin, Chi
Dong, Jie
Liu, Xiaoxia
He, Zengyou
Reconstruction of the Protein-Protein Interaction Network for Protein Complexes Identification by Walking on the Protein Pair Fingerprints Similarity Network
title Reconstruction of the Protein-Protein Interaction Network for Protein Complexes Identification by Walking on the Protein Pair Fingerprints Similarity Network
title_full Reconstruction of the Protein-Protein Interaction Network for Protein Complexes Identification by Walking on the Protein Pair Fingerprints Similarity Network
title_fullStr Reconstruction of the Protein-Protein Interaction Network for Protein Complexes Identification by Walking on the Protein Pair Fingerprints Similarity Network
title_full_unstemmed Reconstruction of the Protein-Protein Interaction Network for Protein Complexes Identification by Walking on the Protein Pair Fingerprints Similarity Network
title_short Reconstruction of the Protein-Protein Interaction Network for Protein Complexes Identification by Walking on the Protein Pair Fingerprints Similarity Network
title_sort reconstruction of the protein-protein interaction network for protein complexes identification by walking on the protein pair fingerprints similarity network
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6067004/
https://www.ncbi.nlm.nih.gov/pubmed/30087694
http://dx.doi.org/10.3389/fgene.2018.00272
work_keys_str_mv AT xubo reconstructionoftheproteinproteininteractionnetworkforproteincomplexesidentificationbywalkingontheproteinpairfingerprintssimilaritynetwork
AT liuyu reconstructionoftheproteinproteininteractionnetworkforproteincomplexesidentificationbywalkingontheproteinpairfingerprintssimilaritynetwork
AT linchi reconstructionoftheproteinproteininteractionnetworkforproteincomplexesidentificationbywalkingontheproteinpairfingerprintssimilaritynetwork
AT dongjie reconstructionoftheproteinproteininteractionnetworkforproteincomplexesidentificationbywalkingontheproteinpairfingerprintssimilaritynetwork
AT liuxiaoxia reconstructionoftheproteinproteininteractionnetworkforproteincomplexesidentificationbywalkingontheproteinpairfingerprintssimilaritynetwork
AT hezengyou reconstructionoftheproteinproteininteractionnetworkforproteincomplexesidentificationbywalkingontheproteinpairfingerprintssimilaritynetwork