Cargando…

Identifying protein complexes based on an edge weight algorithm and core-attachment structure

BACKGROUND: Protein complex identification from protein-protein interaction (PPI) networks is crucial for understanding cellular organization principles and functional mechanisms. In recent decades, numerous computational methods have been proposed to identify protein complexes. However, most of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Rongquan, Liu, Guixia, Wang, Caixia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6744658/
https://www.ncbi.nlm.nih.gov/pubmed/31521132
http://dx.doi.org/10.1186/s12859-019-3007-y
_version_ 1783451416977735680
author Wang, Rongquan
Liu, Guixia
Wang, Caixia
author_facet Wang, Rongquan
Liu, Guixia
Wang, Caixia
author_sort Wang, Rongquan
collection PubMed
description BACKGROUND: Protein complex identification from protein-protein interaction (PPI) networks is crucial for understanding cellular organization principles and functional mechanisms. In recent decades, numerous computational methods have been proposed to identify protein complexes. However, most of the current state-of-the-art studies still have some challenges to resolve, including their high false-positives rates, incapability of identifying overlapping complexes, lack of consideration for the inherent organization within protein complexes, and absence of some biological attachment proteins. RESULTS: In this paper, to overcome these limitations, we present a protein complex identification method based on an edge weight method and core-attachment structure (EWCA) which consists of a complex core and some sparse attachment proteins. First, we propose a new weighting method to assess the reliability of interactions. Second, we identify protein complex cores by using the structural similarity between a seed and its direct neighbors. Third, we introduce a new method to detect attachment proteins that is able to distinguish and identify peripheral proteins and overlapping proteins. Finally, we bind attachment proteins to their corresponding complex cores to form protein complexes and discard redundant protein complexes. The experimental results indicate that EWCA outperforms existing state-of-the-art methods in terms of both accuracy and p-value. Furthermore, EWCA could identify many more protein complexes with statistical significance. Additionally, EWCA could have better balance accuracy and efficiency than some state-of-the-art methods with high accuracy. CONCLUSIONS: In summary, EWCA has better performance for protein complex identification by a comprehensive comparison with twelve algorithms in terms of different evaluation metrics. The datasets and software are freely available for academic research at https://github.com/RongquanWang/EWCA.
format Online
Article
Text
id pubmed-6744658
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-67446582019-09-18 Identifying protein complexes based on an edge weight algorithm and core-attachment structure Wang, Rongquan Liu, Guixia Wang, Caixia BMC Bioinformatics Methodology Article BACKGROUND: Protein complex identification from protein-protein interaction (PPI) networks is crucial for understanding cellular organization principles and functional mechanisms. In recent decades, numerous computational methods have been proposed to identify protein complexes. However, most of the current state-of-the-art studies still have some challenges to resolve, including their high false-positives rates, incapability of identifying overlapping complexes, lack of consideration for the inherent organization within protein complexes, and absence of some biological attachment proteins. RESULTS: In this paper, to overcome these limitations, we present a protein complex identification method based on an edge weight method and core-attachment structure (EWCA) which consists of a complex core and some sparse attachment proteins. First, we propose a new weighting method to assess the reliability of interactions. Second, we identify protein complex cores by using the structural similarity between a seed and its direct neighbors. Third, we introduce a new method to detect attachment proteins that is able to distinguish and identify peripheral proteins and overlapping proteins. Finally, we bind attachment proteins to their corresponding complex cores to form protein complexes and discard redundant protein complexes. The experimental results indicate that EWCA outperforms existing state-of-the-art methods in terms of both accuracy and p-value. Furthermore, EWCA could identify many more protein complexes with statistical significance. Additionally, EWCA could have better balance accuracy and efficiency than some state-of-the-art methods with high accuracy. CONCLUSIONS: In summary, EWCA has better performance for protein complex identification by a comprehensive comparison with twelve algorithms in terms of different evaluation metrics. The datasets and software are freely available for academic research at https://github.com/RongquanWang/EWCA. BioMed Central 2019-09-14 /pmc/articles/PMC6744658/ /pubmed/31521132 http://dx.doi.org/10.1186/s12859-019-3007-y Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Wang, Rongquan
Liu, Guixia
Wang, Caixia
Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title_full Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title_fullStr Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title_full_unstemmed Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title_short Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title_sort identifying protein complexes based on an edge weight algorithm and core-attachment structure
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6744658/
https://www.ncbi.nlm.nih.gov/pubmed/31521132
http://dx.doi.org/10.1186/s12859-019-3007-y
work_keys_str_mv AT wangrongquan identifyingproteincomplexesbasedonanedgeweightalgorithmandcoreattachmentstructure
AT liuguixia identifyingproteincomplexesbasedonanedgeweightalgorithmandcoreattachmentstructure
AT wangcaixia identifyingproteincomplexesbasedonanedgeweightalgorithmandcoreattachmentstructure