Cargando…

SMART: Unique Splitting-While-Merging Framework for Gene Clustering

Successful clustering algorithms are highly dependent on parameter settings. The clustering performance degrades significantly unless parameters are properly set, and yet, it is difficult to set these parameters a priori. To address this issue, in this paper, we propose a unique splitting-while-merg...

Descripción completa

Detalles Bibliográficos
Autores principales: Fa, Rui, Roberts, David J., Nandi, Asoke K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3979766/
https://www.ncbi.nlm.nih.gov/pubmed/24714159
http://dx.doi.org/10.1371/journal.pone.0094141
_version_ 1782310760648540160
author Fa, Rui
Roberts, David J.
Nandi, Asoke K.
author_facet Fa, Rui
Roberts, David J.
Nandi, Asoke K.
author_sort Fa, Rui
collection PubMed
description Successful clustering algorithms are highly dependent on parameter settings. The clustering performance degrades significantly unless parameters are properly set, and yet, it is difficult to set these parameters a priori. To address this issue, in this paper, we propose a unique splitting-while-merging clustering framework, named “splitting merging awareness tactics” (SMART), which does not require any a priori knowledge of either the number of clusters or even the possible range of this number. Unlike existing self-splitting algorithms, which over-cluster the dataset to a large number of clusters and then merge some similar clusters, our framework has the ability to split and merge clusters automatically during the process and produces the the most reliable clustering results, by intrinsically integrating many clustering techniques and tasks. The SMART framework is implemented with two distinct clustering paradigms in two algorithms: competitive learning and finite mixture model. Nevertheless, within the proposed SMART framework, many other algorithms can be derived for different clustering paradigms. The minimum message length algorithm is integrated into the framework as the clustering selection criterion. The usefulness of the SMART framework and its algorithms is tested in demonstration datasets and simulated gene expression datasets. Moreover, two real microarray gene expression datasets are studied using this approach. Based on the performance of many metrics, all numerical results show that SMART is superior to compared existing self-splitting algorithms and traditional algorithms. Three main properties of the proposed SMART framework are summarized as: (1) needing no parameters dependent on the respective dataset or a priori knowledge about the datasets, (2) extendible to many different applications, (3) offering superior performance compared with counterpart algorithms.
format Online
Article
Text
id pubmed-3979766
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39797662014-04-11 SMART: Unique Splitting-While-Merging Framework for Gene Clustering Fa, Rui Roberts, David J. Nandi, Asoke K. PLoS One Research Article Successful clustering algorithms are highly dependent on parameter settings. The clustering performance degrades significantly unless parameters are properly set, and yet, it is difficult to set these parameters a priori. To address this issue, in this paper, we propose a unique splitting-while-merging clustering framework, named “splitting merging awareness tactics” (SMART), which does not require any a priori knowledge of either the number of clusters or even the possible range of this number. Unlike existing self-splitting algorithms, which over-cluster the dataset to a large number of clusters and then merge some similar clusters, our framework has the ability to split and merge clusters automatically during the process and produces the the most reliable clustering results, by intrinsically integrating many clustering techniques and tasks. The SMART framework is implemented with two distinct clustering paradigms in two algorithms: competitive learning and finite mixture model. Nevertheless, within the proposed SMART framework, many other algorithms can be derived for different clustering paradigms. The minimum message length algorithm is integrated into the framework as the clustering selection criterion. The usefulness of the SMART framework and its algorithms is tested in demonstration datasets and simulated gene expression datasets. Moreover, two real microarray gene expression datasets are studied using this approach. Based on the performance of many metrics, all numerical results show that SMART is superior to compared existing self-splitting algorithms and traditional algorithms. Three main properties of the proposed SMART framework are summarized as: (1) needing no parameters dependent on the respective dataset or a priori knowledge about the datasets, (2) extendible to many different applications, (3) offering superior performance compared with counterpart algorithms. Public Library of Science 2014-04-08 /pmc/articles/PMC3979766/ /pubmed/24714159 http://dx.doi.org/10.1371/journal.pone.0094141 Text en © 2014 Fa et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Fa, Rui
Roberts, David J.
Nandi, Asoke K.
SMART: Unique Splitting-While-Merging Framework for Gene Clustering
title SMART: Unique Splitting-While-Merging Framework for Gene Clustering
title_full SMART: Unique Splitting-While-Merging Framework for Gene Clustering
title_fullStr SMART: Unique Splitting-While-Merging Framework for Gene Clustering
title_full_unstemmed SMART: Unique Splitting-While-Merging Framework for Gene Clustering
title_short SMART: Unique Splitting-While-Merging Framework for Gene Clustering
title_sort smart: unique splitting-while-merging framework for gene clustering
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3979766/
https://www.ncbi.nlm.nih.gov/pubmed/24714159
http://dx.doi.org/10.1371/journal.pone.0094141
work_keys_str_mv AT farui smartuniquesplittingwhilemergingframeworkforgeneclustering
AT robertsdavidj smartuniquesplittingwhilemergingframeworkforgeneclustering
AT nandiasokek smartuniquesplittingwhilemergingframeworkforgeneclustering