Cargando…

Identifying protein complexes with fuzzy machine learning model

BACKGROUND: Many computational approaches have been developed to detect protein complexes from protein-protein interaction (PPI) networks. However, these PPI networks are always built from high-throughput experiments. The presence of unreliable interactions in PPI network makes this task very challe...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Bo, Lin, Hongfei, Wagholikar, Kavishwar B, Yang, Zhihao, Liu, Hongfang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3908516/
https://www.ncbi.nlm.nih.gov/pubmed/24565338
http://dx.doi.org/10.1186/1477-5956-11-S1-S21
_version_ 1782301717138767872
author Xu, Bo
Lin, Hongfei
Wagholikar, Kavishwar B
Yang, Zhihao
Liu, Hongfang
author_facet Xu, Bo
Lin, Hongfei
Wagholikar, Kavishwar B
Yang, Zhihao
Liu, Hongfang
author_sort Xu, Bo
collection PubMed
description BACKGROUND: Many computational approaches have been developed to detect protein complexes from protein-protein interaction (PPI) networks. However, these PPI networks are always built from high-throughput experiments. The presence of unreliable interactions in PPI network makes this task very challenging. METHODS: In this study, we proposed a Genetic-Algorithm Fuzzy Naïve Bayes (GAFNB) filter to classify the protein complexes from candidate subgraphs. It takes unreliability into consideration and tackles the presence of unreliable interactions in protein complex. We first got candidate protein complexes through existed popular methods. Each candidate protein complex is represented by 29 graph features and 266 biological property based features. GAFNB model is then applied to classify the candidate complexes into positive or negative. RESULTS: Our evaluation indicates that the protein complex identification algorithms using the GAFNB model filtering outperform original ones. For evaluation of GAFNB model, we also compared the performance of GAFNB with Naïve Bayes (NB). Results show that GAFNB performed better than NB. It indicates that a fuzzy model is more suitable when unreliability is present. CONCLUSIONS: We conclude that filtering candidate protein complexes with GAFNB model can improve the effectiveness of protein complex identification. It is necessary to consider the unreliability in this task.
format Online
Article
Text
id pubmed-3908516
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39085162014-02-13 Identifying protein complexes with fuzzy machine learning model Xu, Bo Lin, Hongfei Wagholikar, Kavishwar B Yang, Zhihao Liu, Hongfang Proteome Sci Research BACKGROUND: Many computational approaches have been developed to detect protein complexes from protein-protein interaction (PPI) networks. However, these PPI networks are always built from high-throughput experiments. The presence of unreliable interactions in PPI network makes this task very challenging. METHODS: In this study, we proposed a Genetic-Algorithm Fuzzy Naïve Bayes (GAFNB) filter to classify the protein complexes from candidate subgraphs. It takes unreliability into consideration and tackles the presence of unreliable interactions in protein complex. We first got candidate protein complexes through existed popular methods. Each candidate protein complex is represented by 29 graph features and 266 biological property based features. GAFNB model is then applied to classify the candidate complexes into positive or negative. RESULTS: Our evaluation indicates that the protein complex identification algorithms using the GAFNB model filtering outperform original ones. For evaluation of GAFNB model, we also compared the performance of GAFNB with Naïve Bayes (NB). Results show that GAFNB performed better than NB. It indicates that a fuzzy model is more suitable when unreliability is present. CONCLUSIONS: We conclude that filtering candidate protein complexes with GAFNB model can improve the effectiveness of protein complex identification. It is necessary to consider the unreliability in this task. BioMed Central 2013-11-07 /pmc/articles/PMC3908516/ /pubmed/24565338 http://dx.doi.org/10.1186/1477-5956-11-S1-S21 Text en Copyright © 2013 Xu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Xu, Bo
Lin, Hongfei
Wagholikar, Kavishwar B
Yang, Zhihao
Liu, Hongfang
Identifying protein complexes with fuzzy machine learning model
title Identifying protein complexes with fuzzy machine learning model
title_full Identifying protein complexes with fuzzy machine learning model
title_fullStr Identifying protein complexes with fuzzy machine learning model
title_full_unstemmed Identifying protein complexes with fuzzy machine learning model
title_short Identifying protein complexes with fuzzy machine learning model
title_sort identifying protein complexes with fuzzy machine learning model
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3908516/
https://www.ncbi.nlm.nih.gov/pubmed/24565338
http://dx.doi.org/10.1186/1477-5956-11-S1-S21
work_keys_str_mv AT xubo identifyingproteincomplexeswithfuzzymachinelearningmodel
AT linhongfei identifyingproteincomplexeswithfuzzymachinelearningmodel
AT wagholikarkavishwarb identifyingproteincomplexeswithfuzzymachinelearningmodel
AT yangzhihao identifyingproteincomplexeswithfuzzymachinelearningmodel
AT liuhongfang identifyingproteincomplexeswithfuzzymachinelearningmodel