Cargando…

PFClust: an optimised implementation of a parameter-free clustering algorithm

BACKGROUND: A well-known problem in cluster analysis is finding an optimal number of clusters reflecting the inherent structure of the data. PFClust is a partitioning-based clustering algorithm capable, unlike many widely-used clustering algorithms, of automatically proposing an optimal number of cl...

Descripción completa

Detalles Bibliográficos
Autores principales: Musayeva, Khadija, Henderson, Tristan, Mitchell, John BO, Mavridis, Lazaros
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3940029/
https://www.ncbi.nlm.nih.gov/pubmed/24490618
http://dx.doi.org/10.1186/1751-0473-9-5
_version_ 1782305770033905664
author Musayeva, Khadija
Henderson, Tristan
Mitchell, John BO
Mavridis, Lazaros
author_facet Musayeva, Khadija
Henderson, Tristan
Mitchell, John BO
Mavridis, Lazaros
author_sort Musayeva, Khadija
collection PubMed
description BACKGROUND: A well-known problem in cluster analysis is finding an optimal number of clusters reflecting the inherent structure of the data. PFClust is a partitioning-based clustering algorithm capable, unlike many widely-used clustering algorithms, of automatically proposing an optimal number of clusters for the data. RESULTS: The results of tests on various types of data showed that PFClust can discover clusters of arbitrary shapes, sizes and densities. The previous implementation of the algorithm had already been successfully used to cluster large macromolecular structures and small druglike compounds. We have greatly improved the algorithm by a more efficient implementation, which enables PFClust to process large data sets acceptably fast. CONCLUSIONS: In this paper we present a new optimized implementation of the PFClust algorithm that runs considerably faster than the original.
format Online
Article
Text
id pubmed-3940029
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39400292014-03-04 PFClust: an optimised implementation of a parameter-free clustering algorithm Musayeva, Khadija Henderson, Tristan Mitchell, John BO Mavridis, Lazaros Source Code Biol Med Brief Reports BACKGROUND: A well-known problem in cluster analysis is finding an optimal number of clusters reflecting the inherent structure of the data. PFClust is a partitioning-based clustering algorithm capable, unlike many widely-used clustering algorithms, of automatically proposing an optimal number of clusters for the data. RESULTS: The results of tests on various types of data showed that PFClust can discover clusters of arbitrary shapes, sizes and densities. The previous implementation of the algorithm had already been successfully used to cluster large macromolecular structures and small druglike compounds. We have greatly improved the algorithm by a more efficient implementation, which enables PFClust to process large data sets acceptably fast. CONCLUSIONS: In this paper we present a new optimized implementation of the PFClust algorithm that runs considerably faster than the original. BioMed Central 2014-02-04 /pmc/articles/PMC3940029/ /pubmed/24490618 http://dx.doi.org/10.1186/1751-0473-9-5 Text en Copyright © 2014 Musayeva et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Brief Reports
Musayeva, Khadija
Henderson, Tristan
Mitchell, John BO
Mavridis, Lazaros
PFClust: an optimised implementation of a parameter-free clustering algorithm
title PFClust: an optimised implementation of a parameter-free clustering algorithm
title_full PFClust: an optimised implementation of a parameter-free clustering algorithm
title_fullStr PFClust: an optimised implementation of a parameter-free clustering algorithm
title_full_unstemmed PFClust: an optimised implementation of a parameter-free clustering algorithm
title_short PFClust: an optimised implementation of a parameter-free clustering algorithm
title_sort pfclust: an optimised implementation of a parameter-free clustering algorithm
topic Brief Reports
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3940029/
https://www.ncbi.nlm.nih.gov/pubmed/24490618
http://dx.doi.org/10.1186/1751-0473-9-5
work_keys_str_mv AT musayevakhadija pfclustanoptimisedimplementationofaparameterfreeclusteringalgorithm
AT hendersontristan pfclustanoptimisedimplementationofaparameterfreeclusteringalgorithm
AT mitchelljohnbo pfclustanoptimisedimplementationofaparameterfreeclusteringalgorithm
AT mavridislazaros pfclustanoptimisedimplementationofaparameterfreeclusteringalgorithm