Cargando…
Data clustering: application and trends
Clustering has primarily been used as an analytical technique to group unlabeled data for extracting meaningful information. The fact that no clustering algorithm can solve all clustering problems has resulted in the development of several clustering algorithms with diverse applications. We review d...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9702941/ https://www.ncbi.nlm.nih.gov/pubmed/36466764 http://dx.doi.org/10.1007/s10462-022-10325-y |
_version_ | 1784839755912970240 |
---|---|
author | Oyewole, Gbeminiyi John Thopil, George Alex |
author_facet | Oyewole, Gbeminiyi John Thopil, George Alex |
author_sort | Oyewole, Gbeminiyi John |
collection | PubMed |
description | Clustering has primarily been used as an analytical technique to group unlabeled data for extracting meaningful information. The fact that no clustering algorithm can solve all clustering problems has resulted in the development of several clustering algorithms with diverse applications. We review data clustering, intending to underscore recent applications in selected industrial sectors and other notable concepts. In this paper, we begin by highlighting clustering components and discussing classification terminologies. Furthermore, specific, and general applications of clustering are discussed. Notable concepts on clustering algorithms, emerging variants, measures of similarities/dissimilarities, issues surrounding clustering optimization, validation and data types are outlined. Suggestions are made to emphasize the continued interest in clustering techniques both by scholars and Industry practitioners. Key findings in this review show the size of data as a classification criterion and as data sizes for clustering become larger and varied, the determination of the optimal number of clusters will require new feature extracting methods, validation indices and clustering techniques. In addition, clustering techniques have found growing use in key industry sectors linked to the sustainable development goals such as manufacturing, transportation and logistics, energy, and healthcare, where the use of clustering is more integrated with other analytical techniques than a stand-alone clustering technique. |
format | Online Article Text |
id | pubmed-9702941 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-97029412022-11-28 Data clustering: application and trends Oyewole, Gbeminiyi John Thopil, George Alex Artif Intell Rev Article Clustering has primarily been used as an analytical technique to group unlabeled data for extracting meaningful information. The fact that no clustering algorithm can solve all clustering problems has resulted in the development of several clustering algorithms with diverse applications. We review data clustering, intending to underscore recent applications in selected industrial sectors and other notable concepts. In this paper, we begin by highlighting clustering components and discussing classification terminologies. Furthermore, specific, and general applications of clustering are discussed. Notable concepts on clustering algorithms, emerging variants, measures of similarities/dissimilarities, issues surrounding clustering optimization, validation and data types are outlined. Suggestions are made to emphasize the continued interest in clustering techniques both by scholars and Industry practitioners. Key findings in this review show the size of data as a classification criterion and as data sizes for clustering become larger and varied, the determination of the optimal number of clusters will require new feature extracting methods, validation indices and clustering techniques. In addition, clustering techniques have found growing use in key industry sectors linked to the sustainable development goals such as manufacturing, transportation and logistics, energy, and healthcare, where the use of clustering is more integrated with other analytical techniques than a stand-alone clustering technique. Springer Netherlands 2022-11-27 2023 /pmc/articles/PMC9702941/ /pubmed/36466764 http://dx.doi.org/10.1007/s10462-022-10325-y Text en © The Author(s), under exclusive licence to Springer Nature B.V. 2022. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Oyewole, Gbeminiyi John Thopil, George Alex Data clustering: application and trends |
title | Data clustering: application and trends |
title_full | Data clustering: application and trends |
title_fullStr | Data clustering: application and trends |
title_full_unstemmed | Data clustering: application and trends |
title_short | Data clustering: application and trends |
title_sort | data clustering: application and trends |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9702941/ https://www.ncbi.nlm.nih.gov/pubmed/36466764 http://dx.doi.org/10.1007/s10462-022-10325-y |
work_keys_str_mv | AT oyewolegbeminiyijohn dataclusteringapplicationandtrends AT thopilgeorgealex dataclusteringapplicationandtrends |