Cargando…

Minimum Distribution Support Vector Clustering

Support vector clustering (SVC) is a boundary-based algorithm, which has several advantages over other clustering methods, including identifying clusters of arbitrary shapes and numbers. Leveraged by the high generalization ability of the large margin distribution machine (LDM) and the optimal margi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yan, Chen, Jiali, Xie, Xuping, Yang, Sen, Pang, Wei, Huang, Lan, Zhang, Shuangquan, Zhao, Shishun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8623761/
https://www.ncbi.nlm.nih.gov/pubmed/34828171
http://dx.doi.org/10.3390/e23111473
_version_ 1784606009629605888
author Wang, Yan
Chen, Jiali
Xie, Xuping
Yang, Sen
Pang, Wei
Huang, Lan
Zhang, Shuangquan
Zhao, Shishun
author_facet Wang, Yan
Chen, Jiali
Xie, Xuping
Yang, Sen
Pang, Wei
Huang, Lan
Zhang, Shuangquan
Zhao, Shishun
author_sort Wang, Yan
collection PubMed
description Support vector clustering (SVC) is a boundary-based algorithm, which has several advantages over other clustering methods, including identifying clusters of arbitrary shapes and numbers. Leveraged by the high generalization ability of the large margin distribution machine (LDM) and the optimal margin distribution clustering (ODMC), we propose a new clustering method: minimum distribution for support vector clustering (MDSVC), for improving the robustness of boundary point recognition, which characterizes the optimal hypersphere by the first-order and second-order statistics and tries to minimize the mean and variance simultaneously. In addition, we further prove, theoretically, that our algorithm can obtain better generalization performance. Some instructive insights for adjusting the number of support vector points are gained. For the optimization problem of MDSVC, we propose a double coordinate descent algorithm for small and medium samples. The experimental results on both artificial and real datasets indicate that our MDSVC has a significant improvement in generalization performance compared to SVC.
format Online
Article
Text
id pubmed-8623761
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86237612021-11-27 Minimum Distribution Support Vector Clustering Wang, Yan Chen, Jiali Xie, Xuping Yang, Sen Pang, Wei Huang, Lan Zhang, Shuangquan Zhao, Shishun Entropy (Basel) Article Support vector clustering (SVC) is a boundary-based algorithm, which has several advantages over other clustering methods, including identifying clusters of arbitrary shapes and numbers. Leveraged by the high generalization ability of the large margin distribution machine (LDM) and the optimal margin distribution clustering (ODMC), we propose a new clustering method: minimum distribution for support vector clustering (MDSVC), for improving the robustness of boundary point recognition, which characterizes the optimal hypersphere by the first-order and second-order statistics and tries to minimize the mean and variance simultaneously. In addition, we further prove, theoretically, that our algorithm can obtain better generalization performance. Some instructive insights for adjusting the number of support vector points are gained. For the optimization problem of MDSVC, we propose a double coordinate descent algorithm for small and medium samples. The experimental results on both artificial and real datasets indicate that our MDSVC has a significant improvement in generalization performance compared to SVC. MDPI 2021-11-08 /pmc/articles/PMC8623761/ /pubmed/34828171 http://dx.doi.org/10.3390/e23111473 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Yan
Chen, Jiali
Xie, Xuping
Yang, Sen
Pang, Wei
Huang, Lan
Zhang, Shuangquan
Zhao, Shishun
Minimum Distribution Support Vector Clustering
title Minimum Distribution Support Vector Clustering
title_full Minimum Distribution Support Vector Clustering
title_fullStr Minimum Distribution Support Vector Clustering
title_full_unstemmed Minimum Distribution Support Vector Clustering
title_short Minimum Distribution Support Vector Clustering
title_sort minimum distribution support vector clustering
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8623761/
https://www.ncbi.nlm.nih.gov/pubmed/34828171
http://dx.doi.org/10.3390/e23111473
work_keys_str_mv AT wangyan minimumdistributionsupportvectorclustering
AT chenjiali minimumdistributionsupportvectorclustering
AT xiexuping minimumdistributionsupportvectorclustering
AT yangsen minimumdistributionsupportvectorclustering
AT pangwei minimumdistributionsupportvectorclustering
AT huanglan minimumdistributionsupportvectorclustering
AT zhangshuangquan minimumdistributionsupportvectorclustering
AT zhaoshishun minimumdistributionsupportvectorclustering