Cargando…
PFClust: an optimised implementation of a parameter-free clustering algorithm
BACKGROUND: A well-known problem in cluster analysis is finding an optimal number of clusters reflecting the inherent structure of the data. PFClust is a partitioning-based clustering algorithm capable, unlike many widely-used clustering algorithms, of automatically proposing an optimal number of cl...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3940029/ https://www.ncbi.nlm.nih.gov/pubmed/24490618 http://dx.doi.org/10.1186/1751-0473-9-5 |
_version_ | 1782305770033905664 |
---|---|
author | Musayeva, Khadija Henderson, Tristan Mitchell, John BO Mavridis, Lazaros |
author_facet | Musayeva, Khadija Henderson, Tristan Mitchell, John BO Mavridis, Lazaros |
author_sort | Musayeva, Khadija |
collection | PubMed |
description | BACKGROUND: A well-known problem in cluster analysis is finding an optimal number of clusters reflecting the inherent structure of the data. PFClust is a partitioning-based clustering algorithm capable, unlike many widely-used clustering algorithms, of automatically proposing an optimal number of clusters for the data. RESULTS: The results of tests on various types of data showed that PFClust can discover clusters of arbitrary shapes, sizes and densities. The previous implementation of the algorithm had already been successfully used to cluster large macromolecular structures and small druglike compounds. We have greatly improved the algorithm by a more efficient implementation, which enables PFClust to process large data sets acceptably fast. CONCLUSIONS: In this paper we present a new optimized implementation of the PFClust algorithm that runs considerably faster than the original. |
format | Online Article Text |
id | pubmed-3940029 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-39400292014-03-04 PFClust: an optimised implementation of a parameter-free clustering algorithm Musayeva, Khadija Henderson, Tristan Mitchell, John BO Mavridis, Lazaros Source Code Biol Med Brief Reports BACKGROUND: A well-known problem in cluster analysis is finding an optimal number of clusters reflecting the inherent structure of the data. PFClust is a partitioning-based clustering algorithm capable, unlike many widely-used clustering algorithms, of automatically proposing an optimal number of clusters for the data. RESULTS: The results of tests on various types of data showed that PFClust can discover clusters of arbitrary shapes, sizes and densities. The previous implementation of the algorithm had already been successfully used to cluster large macromolecular structures and small druglike compounds. We have greatly improved the algorithm by a more efficient implementation, which enables PFClust to process large data sets acceptably fast. CONCLUSIONS: In this paper we present a new optimized implementation of the PFClust algorithm that runs considerably faster than the original. BioMed Central 2014-02-04 /pmc/articles/PMC3940029/ /pubmed/24490618 http://dx.doi.org/10.1186/1751-0473-9-5 Text en Copyright © 2014 Musayeva et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Brief Reports Musayeva, Khadija Henderson, Tristan Mitchell, John BO Mavridis, Lazaros PFClust: an optimised implementation of a parameter-free clustering algorithm |
title | PFClust: an optimised implementation of a parameter-free clustering algorithm |
title_full | PFClust: an optimised implementation of a parameter-free clustering algorithm |
title_fullStr | PFClust: an optimised implementation of a parameter-free clustering algorithm |
title_full_unstemmed | PFClust: an optimised implementation of a parameter-free clustering algorithm |
title_short | PFClust: an optimised implementation of a parameter-free clustering algorithm |
title_sort | pfclust: an optimised implementation of a parameter-free clustering algorithm |
topic | Brief Reports |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3940029/ https://www.ncbi.nlm.nih.gov/pubmed/24490618 http://dx.doi.org/10.1186/1751-0473-9-5 |
work_keys_str_mv | AT musayevakhadija pfclustanoptimisedimplementationofaparameterfreeclusteringalgorithm AT hendersontristan pfclustanoptimisedimplementationofaparameterfreeclusteringalgorithm AT mitchelljohnbo pfclustanoptimisedimplementationofaparameterfreeclusteringalgorithm AT mavridislazaros pfclustanoptimisedimplementationofaparameterfreeclusteringalgorithm |