Cargando…

Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method

Data Streams create new challenges for fuzzy clustering algorithms, specifically Interval Type-2 Fuzzy C-Means (IT2FCM). One problem associated with IT2FCM is that it tends to be sensitive to initialization conditions and therefore, fails to return global optima. This problem has been addressed by o...

Descripción completa

Detalles Bibliográficos
Autores principales: Qaiyum, Sana, Aziz, Izzatdin, Hasan, Mohd Hilmi, Khan, Asif Irshad, Almalawi, Abdulmohsen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7309007/
https://www.ncbi.nlm.nih.gov/pubmed/32517018
http://dx.doi.org/10.3390/s20113210
_version_ 1783549123983572992
author Qaiyum, Sana
Aziz, Izzatdin
Hasan, Mohd Hilmi
Khan, Asif Irshad
Almalawi, Abdulmohsen
author_facet Qaiyum, Sana
Aziz, Izzatdin
Hasan, Mohd Hilmi
Khan, Asif Irshad
Almalawi, Abdulmohsen
author_sort Qaiyum, Sana
collection PubMed
description Data Streams create new challenges for fuzzy clustering algorithms, specifically Interval Type-2 Fuzzy C-Means (IT2FCM). One problem associated with IT2FCM is that it tends to be sensitive to initialization conditions and therefore, fails to return global optima. This problem has been addressed by optimizing IT2FCM using Ant Colony Optimization approach. However, IT2FCM-ACO obtain clusters for the whole dataset which is not suitable for clustering large streaming datasets that may be coming continuously and evolves with time. Thus, the clusters generated will also evolve with time. Additionally, the incoming data may not be available in memory all at once because of its size. Therefore, to encounter the challenges of a large data stream environment we propose improvising IT2FCM-ACO to generate clusters incrementally. The proposed algorithm produces clusters by determining appropriate cluster centers on a certain percentage of available datasets and then the obtained cluster centroids are combined with new incoming data points to generate another set of cluster centers. The process continues until all the data are scanned. The previous data points are released from memory which reduces time and space complexity. Thus, the proposed incremental method produces data partitions comparable to IT2FCM-ACO. The performance of the proposed method is evaluated on large real-life datasets. The results obtained from several fuzzy cluster validity index measures show the enhanced performance of the proposed method over other clustering algorithms. The proposed algorithm also improves upon the run time and produces excellent speed-ups for all datasets.
format Online
Article
Text
id pubmed-7309007
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73090072020-06-25 Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method Qaiyum, Sana Aziz, Izzatdin Hasan, Mohd Hilmi Khan, Asif Irshad Almalawi, Abdulmohsen Sensors (Basel) Article Data Streams create new challenges for fuzzy clustering algorithms, specifically Interval Type-2 Fuzzy C-Means (IT2FCM). One problem associated with IT2FCM is that it tends to be sensitive to initialization conditions and therefore, fails to return global optima. This problem has been addressed by optimizing IT2FCM using Ant Colony Optimization approach. However, IT2FCM-ACO obtain clusters for the whole dataset which is not suitable for clustering large streaming datasets that may be coming continuously and evolves with time. Thus, the clusters generated will also evolve with time. Additionally, the incoming data may not be available in memory all at once because of its size. Therefore, to encounter the challenges of a large data stream environment we propose improvising IT2FCM-ACO to generate clusters incrementally. The proposed algorithm produces clusters by determining appropriate cluster centers on a certain percentage of available datasets and then the obtained cluster centroids are combined with new incoming data points to generate another set of cluster centers. The process continues until all the data are scanned. The previous data points are released from memory which reduces time and space complexity. Thus, the proposed incremental method produces data partitions comparable to IT2FCM-ACO. The performance of the proposed method is evaluated on large real-life datasets. The results obtained from several fuzzy cluster validity index measures show the enhanced performance of the proposed method over other clustering algorithms. The proposed algorithm also improves upon the run time and produces excellent speed-ups for all datasets. MDPI 2020-06-05 /pmc/articles/PMC7309007/ /pubmed/32517018 http://dx.doi.org/10.3390/s20113210 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Qaiyum, Sana
Aziz, Izzatdin
Hasan, Mohd Hilmi
Khan, Asif Irshad
Almalawi, Abdulmohsen
Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method
title Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method
title_full Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method
title_fullStr Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method
title_full_unstemmed Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method
title_short Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method
title_sort incremental interval type-2 fuzzy clustering of data streams using single pass method
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7309007/
https://www.ncbi.nlm.nih.gov/pubmed/32517018
http://dx.doi.org/10.3390/s20113210
work_keys_str_mv AT qaiyumsana incrementalintervaltype2fuzzyclusteringofdatastreamsusingsinglepassmethod
AT azizizzatdin incrementalintervaltype2fuzzyclusteringofdatastreamsusingsinglepassmethod
AT hasanmohdhilmi incrementalintervaltype2fuzzyclusteringofdatastreamsusingsinglepassmethod
AT khanasifirshad incrementalintervaltype2fuzzyclusteringofdatastreamsusingsinglepassmethod
AT almalawiabdulmohsen incrementalintervaltype2fuzzyclusteringofdatastreamsusingsinglepassmethod