Cargando…

A novel harmony search-K means hybrid algorithm for clustering gene expression data

Recent progress in bioinformatics research has led to the accumulation of huge quantities of biological data at various data sources. The DNA microarray technology makes it possible to simultaneously analyze large number of genes across different samples. Clustering of microarray data can reveal the...

Descripción completa

Detalles Bibliográficos
Autores principales: Nazeer, KA Abdul, Sebastian, MP, Kumar, SD Madhu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Biomedical Informatics 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3563403/
https://www.ncbi.nlm.nih.gov/pubmed/23390351
http://dx.doi.org/10.6026/97320630009084
_version_ 1782258178708209664
author Nazeer, KA Abdul
Sebastian, MP
Kumar, SD Madhu
author_facet Nazeer, KA Abdul
Sebastian, MP
Kumar, SD Madhu
author_sort Nazeer, KA Abdul
collection PubMed
description Recent progress in bioinformatics research has led to the accumulation of huge quantities of biological data at various data sources. The DNA microarray technology makes it possible to simultaneously analyze large number of genes across different samples. Clustering of microarray data can reveal the hidden gene expression patterns from large quantities of expression data that in turn offers tremendous possibilities in functional genomics, comparative genomics, disease diagnosis and drug development. The k- ¬means clustering algorithm is widely used for many practical applications. But the original k-¬means algorithm has several drawbacks. It is computationally expensive and generates locally optimal solutions based on the random choice of the initial centroids. Several methods have been proposed in the literature for improving the performance of the k-¬means algorithm. A meta-heuristic optimization algorithm named harmony search helps find out near-global optimal solutions by searching the entire solution space. Low clustering accuracy of the existing algorithms limits their use in many crucial applications of life sciences. In this paper we propose a novel Harmony Search-K means Hybrid (HSKH) algorithm for clustering the gene expression data. Experimental results show that the proposed algorithm produces clusters with better accuracy in comparison with the existing algorithms.
format Online
Article
Text
id pubmed-3563403
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Biomedical Informatics
record_format MEDLINE/PubMed
spelling pubmed-35634032013-02-06 A novel harmony search-K means hybrid algorithm for clustering gene expression data Nazeer, KA Abdul Sebastian, MP Kumar, SD Madhu Bioinformation Hypothesis Recent progress in bioinformatics research has led to the accumulation of huge quantities of biological data at various data sources. The DNA microarray technology makes it possible to simultaneously analyze large number of genes across different samples. Clustering of microarray data can reveal the hidden gene expression patterns from large quantities of expression data that in turn offers tremendous possibilities in functional genomics, comparative genomics, disease diagnosis and drug development. The k- ¬means clustering algorithm is widely used for many practical applications. But the original k-¬means algorithm has several drawbacks. It is computationally expensive and generates locally optimal solutions based on the random choice of the initial centroids. Several methods have been proposed in the literature for improving the performance of the k-¬means algorithm. A meta-heuristic optimization algorithm named harmony search helps find out near-global optimal solutions by searching the entire solution space. Low clustering accuracy of the existing algorithms limits their use in many crucial applications of life sciences. In this paper we propose a novel Harmony Search-K means Hybrid (HSKH) algorithm for clustering the gene expression data. Experimental results show that the proposed algorithm produces clusters with better accuracy in comparison with the existing algorithms. Biomedical Informatics 2013-01-18 /pmc/articles/PMC3563403/ /pubmed/23390351 http://dx.doi.org/10.6026/97320630009084 Text en © 2013 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Hypothesis
Nazeer, KA Abdul
Sebastian, MP
Kumar, SD Madhu
A novel harmony search-K means hybrid algorithm for clustering gene expression data
title A novel harmony search-K means hybrid algorithm for clustering gene expression data
title_full A novel harmony search-K means hybrid algorithm for clustering gene expression data
title_fullStr A novel harmony search-K means hybrid algorithm for clustering gene expression data
title_full_unstemmed A novel harmony search-K means hybrid algorithm for clustering gene expression data
title_short A novel harmony search-K means hybrid algorithm for clustering gene expression data
title_sort novel harmony search-k means hybrid algorithm for clustering gene expression data
topic Hypothesis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3563403/
https://www.ncbi.nlm.nih.gov/pubmed/23390351
http://dx.doi.org/10.6026/97320630009084
work_keys_str_mv AT nazeerkaabdul anovelharmonysearchkmeanshybridalgorithmforclusteringgeneexpressiondata
AT sebastianmp anovelharmonysearchkmeanshybridalgorithmforclusteringgeneexpressiondata
AT kumarsdmadhu anovelharmonysearchkmeanshybridalgorithmforclusteringgeneexpressiondata
AT nazeerkaabdul novelharmonysearchkmeanshybridalgorithmforclusteringgeneexpressiondata
AT sebastianmp novelharmonysearchkmeanshybridalgorithmforclusteringgeneexpressiondata
AT kumarsdmadhu novelharmonysearchkmeanshybridalgorithmforclusteringgeneexpressiondata