Cargando…

Clustering PPI data by combining FA and SHC method

Clustering is one of main methods to identify functional modules from protein-protein interaction (PPI) data. Nevertheless traditional clustering methods may not be effective for clustering PPI data. In this paper, we proposed a novel method for clustering PPI data by combining firefly algorithm (FA...

Descripción completa

Detalles Bibliográficos
Autores principales: Lei, Xiujuan, Ying, Chao, Wu, Fang-Xiang, Xu, Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4331806/
https://www.ncbi.nlm.nih.gov/pubmed/25707632
http://dx.doi.org/10.1186/1471-2164-16-S3-S3
_version_ 1782357783351394304
author Lei, Xiujuan
Ying, Chao
Wu, Fang-Xiang
Xu, Jin
author_facet Lei, Xiujuan
Ying, Chao
Wu, Fang-Xiang
Xu, Jin
author_sort Lei, Xiujuan
collection PubMed
description Clustering is one of main methods to identify functional modules from protein-protein interaction (PPI) data. Nevertheless traditional clustering methods may not be effective for clustering PPI data. In this paper, we proposed a novel method for clustering PPI data by combining firefly algorithm (FA) and synchronization-based hierarchical clustering (SHC) algorithm. Firstly, the PPI data are preprocessed via spectral clustering (SC) which transforms the high-dimensional similarity matrix into a low dimension matrix. Then the SHC algorithm is used to perform clustering. In SHC algorithm, hierarchical clustering is achieved by enlarging the neighborhood radius of synchronized objects continuously, while the hierarchical search is very difficult to find the optimal neighborhood radius of synchronization and the efficiency is not high. So we adopt the firefly algorithm to determine the optimal threshold of the neighborhood radius of synchronization automatically. The proposed algorithm is tested on the MIPS PPI dataset. The results show that our proposed algorithm is better than the traditional algorithms in precision, recall and f-measure value.
format Online
Article
Text
id pubmed-4331806
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43318062015-03-19 Clustering PPI data by combining FA and SHC method Lei, Xiujuan Ying, Chao Wu, Fang-Xiang Xu, Jin BMC Genomics Proceedings Clustering is one of main methods to identify functional modules from protein-protein interaction (PPI) data. Nevertheless traditional clustering methods may not be effective for clustering PPI data. In this paper, we proposed a novel method for clustering PPI data by combining firefly algorithm (FA) and synchronization-based hierarchical clustering (SHC) algorithm. Firstly, the PPI data are preprocessed via spectral clustering (SC) which transforms the high-dimensional similarity matrix into a low dimension matrix. Then the SHC algorithm is used to perform clustering. In SHC algorithm, hierarchical clustering is achieved by enlarging the neighborhood radius of synchronized objects continuously, while the hierarchical search is very difficult to find the optimal neighborhood radius of synchronization and the efficiency is not high. So we adopt the firefly algorithm to determine the optimal threshold of the neighborhood radius of synchronization automatically. The proposed algorithm is tested on the MIPS PPI dataset. The results show that our proposed algorithm is better than the traditional algorithms in precision, recall and f-measure value. BioMed Central 2015-01-29 /pmc/articles/PMC4331806/ /pubmed/25707632 http://dx.doi.org/10.1186/1471-2164-16-S3-S3 Text en Copyright © 2015 Lei et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Lei, Xiujuan
Ying, Chao
Wu, Fang-Xiang
Xu, Jin
Clustering PPI data by combining FA and SHC method
title Clustering PPI data by combining FA and SHC method
title_full Clustering PPI data by combining FA and SHC method
title_fullStr Clustering PPI data by combining FA and SHC method
title_full_unstemmed Clustering PPI data by combining FA and SHC method
title_short Clustering PPI data by combining FA and SHC method
title_sort clustering ppi data by combining fa and shc method
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4331806/
https://www.ncbi.nlm.nih.gov/pubmed/25707632
http://dx.doi.org/10.1186/1471-2164-16-S3-S3
work_keys_str_mv AT leixiujuan clusteringppidatabycombiningfaandshcmethod
AT yingchao clusteringppidatabycombiningfaandshcmethod
AT wufangxiang clusteringppidatabycombiningfaandshcmethod
AT xujin clusteringppidatabycombiningfaandshcmethod