Cargando…
Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method
Data Streams create new challenges for fuzzy clustering algorithms, specifically Interval Type-2 Fuzzy C-Means (IT2FCM). One problem associated with IT2FCM is that it tends to be sensitive to initialization conditions and therefore, fails to return global optima. This problem has been addressed by o...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7309007/ https://www.ncbi.nlm.nih.gov/pubmed/32517018 http://dx.doi.org/10.3390/s20113210 |
_version_ | 1783549123983572992 |
---|---|
author | Qaiyum, Sana Aziz, Izzatdin Hasan, Mohd Hilmi Khan, Asif Irshad Almalawi, Abdulmohsen |
author_facet | Qaiyum, Sana Aziz, Izzatdin Hasan, Mohd Hilmi Khan, Asif Irshad Almalawi, Abdulmohsen |
author_sort | Qaiyum, Sana |
collection | PubMed |
description | Data Streams create new challenges for fuzzy clustering algorithms, specifically Interval Type-2 Fuzzy C-Means (IT2FCM). One problem associated with IT2FCM is that it tends to be sensitive to initialization conditions and therefore, fails to return global optima. This problem has been addressed by optimizing IT2FCM using Ant Colony Optimization approach. However, IT2FCM-ACO obtain clusters for the whole dataset which is not suitable for clustering large streaming datasets that may be coming continuously and evolves with time. Thus, the clusters generated will also evolve with time. Additionally, the incoming data may not be available in memory all at once because of its size. Therefore, to encounter the challenges of a large data stream environment we propose improvising IT2FCM-ACO to generate clusters incrementally. The proposed algorithm produces clusters by determining appropriate cluster centers on a certain percentage of available datasets and then the obtained cluster centroids are combined with new incoming data points to generate another set of cluster centers. The process continues until all the data are scanned. The previous data points are released from memory which reduces time and space complexity. Thus, the proposed incremental method produces data partitions comparable to IT2FCM-ACO. The performance of the proposed method is evaluated on large real-life datasets. The results obtained from several fuzzy cluster validity index measures show the enhanced performance of the proposed method over other clustering algorithms. The proposed algorithm also improves upon the run time and produces excellent speed-ups for all datasets. |
format | Online Article Text |
id | pubmed-7309007 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-73090072020-06-25 Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method Qaiyum, Sana Aziz, Izzatdin Hasan, Mohd Hilmi Khan, Asif Irshad Almalawi, Abdulmohsen Sensors (Basel) Article Data Streams create new challenges for fuzzy clustering algorithms, specifically Interval Type-2 Fuzzy C-Means (IT2FCM). One problem associated with IT2FCM is that it tends to be sensitive to initialization conditions and therefore, fails to return global optima. This problem has been addressed by optimizing IT2FCM using Ant Colony Optimization approach. However, IT2FCM-ACO obtain clusters for the whole dataset which is not suitable for clustering large streaming datasets that may be coming continuously and evolves with time. Thus, the clusters generated will also evolve with time. Additionally, the incoming data may not be available in memory all at once because of its size. Therefore, to encounter the challenges of a large data stream environment we propose improvising IT2FCM-ACO to generate clusters incrementally. The proposed algorithm produces clusters by determining appropriate cluster centers on a certain percentage of available datasets and then the obtained cluster centroids are combined with new incoming data points to generate another set of cluster centers. The process continues until all the data are scanned. The previous data points are released from memory which reduces time and space complexity. Thus, the proposed incremental method produces data partitions comparable to IT2FCM-ACO. The performance of the proposed method is evaluated on large real-life datasets. The results obtained from several fuzzy cluster validity index measures show the enhanced performance of the proposed method over other clustering algorithms. The proposed algorithm also improves upon the run time and produces excellent speed-ups for all datasets. MDPI 2020-06-05 /pmc/articles/PMC7309007/ /pubmed/32517018 http://dx.doi.org/10.3390/s20113210 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Qaiyum, Sana Aziz, Izzatdin Hasan, Mohd Hilmi Khan, Asif Irshad Almalawi, Abdulmohsen Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method |
title | Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method |
title_full | Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method |
title_fullStr | Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method |
title_full_unstemmed | Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method |
title_short | Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method |
title_sort | incremental interval type-2 fuzzy clustering of data streams using single pass method |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7309007/ https://www.ncbi.nlm.nih.gov/pubmed/32517018 http://dx.doi.org/10.3390/s20113210 |
work_keys_str_mv | AT qaiyumsana incrementalintervaltype2fuzzyclusteringofdatastreamsusingsinglepassmethod AT azizizzatdin incrementalintervaltype2fuzzyclusteringofdatastreamsusingsinglepassmethod AT hasanmohdhilmi incrementalintervaltype2fuzzyclusteringofdatastreamsusingsinglepassmethod AT khanasifirshad incrementalintervaltype2fuzzyclusteringofdatastreamsusingsinglepassmethod AT almalawiabdulmohsen incrementalintervaltype2fuzzyclusteringofdatastreamsusingsinglepassmethod |