Cargando…

Deep Possibilistic C-means Clustering Algorithm on Medical Datasets

In the past, the possibilistic C-means clustering algorithm (PCM) has proven its superiority on various medical datasets by overcoming the unstable clustering effect caused by both the hard division of traditional hard clustering models and the susceptibility of fuzzy C-means clustering algorithm (F...

Descripción completa

Detalles Bibliográficos
Autores principales: Gu, Yuxin, Ni, Tongguang, Jiang, Yizhang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9034915/
https://www.ncbi.nlm.nih.gov/pubmed/35469221
http://dx.doi.org/10.1155/2022/3469979
Descripción
Sumario:In the past, the possibilistic C-means clustering algorithm (PCM) has proven its superiority on various medical datasets by overcoming the unstable clustering effect caused by both the hard division of traditional hard clustering models and the susceptibility of fuzzy C-means clustering algorithm (FCM) to noise. However, with the deep integration and development of the Internet of Things (IoT) as well as big data with the medical field, the width and height of medical datasets are growing bigger and bigger. In the face of high-dimensional and giant complex datasets, it is challenging for the PCM algorithm based on machine learning to extract valuable features from thousands of dimensions, which increases the computational complexity and useless time consumption and makes it difficult to avoid the quality problem of clustering. To this end, this paper proposes a deep possibilistic C-mean clustering algorithm (DPCM) that combines the traditional PCM algorithm with a special deep network called autoencoder. Taking advantage of the fact that the autoencoder can minimize the reconstruction loss and the PCM uses soft affiliation to facilitate gradient descent, DPCM allows deep neural networks and PCM's clustering centers to be optimized at the same time, so that it effectively improves the clustering efficiency and accuracy. Experiments on medical datasets with various dimensions demonstrate that this method has a better effect than traditional clustering methods, besides being able to overcome the interference of noise better.