Cargando…
Minimum Distribution Support Vector Clustering
Support vector clustering (SVC) is a boundary-based algorithm, which has several advantages over other clustering methods, including identifying clusters of arbitrary shapes and numbers. Leveraged by the high generalization ability of the large margin distribution machine (LDM) and the optimal margi...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8623761/ https://www.ncbi.nlm.nih.gov/pubmed/34828171 http://dx.doi.org/10.3390/e23111473 |
_version_ | 1784606009629605888 |
---|---|
author | Wang, Yan Chen, Jiali Xie, Xuping Yang, Sen Pang, Wei Huang, Lan Zhang, Shuangquan Zhao, Shishun |
author_facet | Wang, Yan Chen, Jiali Xie, Xuping Yang, Sen Pang, Wei Huang, Lan Zhang, Shuangquan Zhao, Shishun |
author_sort | Wang, Yan |
collection | PubMed |
description | Support vector clustering (SVC) is a boundary-based algorithm, which has several advantages over other clustering methods, including identifying clusters of arbitrary shapes and numbers. Leveraged by the high generalization ability of the large margin distribution machine (LDM) and the optimal margin distribution clustering (ODMC), we propose a new clustering method: minimum distribution for support vector clustering (MDSVC), for improving the robustness of boundary point recognition, which characterizes the optimal hypersphere by the first-order and second-order statistics and tries to minimize the mean and variance simultaneously. In addition, we further prove, theoretically, that our algorithm can obtain better generalization performance. Some instructive insights for adjusting the number of support vector points are gained. For the optimization problem of MDSVC, we propose a double coordinate descent algorithm for small and medium samples. The experimental results on both artificial and real datasets indicate that our MDSVC has a significant improvement in generalization performance compared to SVC. |
format | Online Article Text |
id | pubmed-8623761 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-86237612021-11-27 Minimum Distribution Support Vector Clustering Wang, Yan Chen, Jiali Xie, Xuping Yang, Sen Pang, Wei Huang, Lan Zhang, Shuangquan Zhao, Shishun Entropy (Basel) Article Support vector clustering (SVC) is a boundary-based algorithm, which has several advantages over other clustering methods, including identifying clusters of arbitrary shapes and numbers. Leveraged by the high generalization ability of the large margin distribution machine (LDM) and the optimal margin distribution clustering (ODMC), we propose a new clustering method: minimum distribution for support vector clustering (MDSVC), for improving the robustness of boundary point recognition, which characterizes the optimal hypersphere by the first-order and second-order statistics and tries to minimize the mean and variance simultaneously. In addition, we further prove, theoretically, that our algorithm can obtain better generalization performance. Some instructive insights for adjusting the number of support vector points are gained. For the optimization problem of MDSVC, we propose a double coordinate descent algorithm for small and medium samples. The experimental results on both artificial and real datasets indicate that our MDSVC has a significant improvement in generalization performance compared to SVC. MDPI 2021-11-08 /pmc/articles/PMC8623761/ /pubmed/34828171 http://dx.doi.org/10.3390/e23111473 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Yan Chen, Jiali Xie, Xuping Yang, Sen Pang, Wei Huang, Lan Zhang, Shuangquan Zhao, Shishun Minimum Distribution Support Vector Clustering |
title | Minimum Distribution Support Vector Clustering |
title_full | Minimum Distribution Support Vector Clustering |
title_fullStr | Minimum Distribution Support Vector Clustering |
title_full_unstemmed | Minimum Distribution Support Vector Clustering |
title_short | Minimum Distribution Support Vector Clustering |
title_sort | minimum distribution support vector clustering |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8623761/ https://www.ncbi.nlm.nih.gov/pubmed/34828171 http://dx.doi.org/10.3390/e23111473 |
work_keys_str_mv | AT wangyan minimumdistributionsupportvectorclustering AT chenjiali minimumdistributionsupportvectorclustering AT xiexuping minimumdistributionsupportvectorclustering AT yangsen minimumdistributionsupportvectorclustering AT pangwei minimumdistributionsupportvectorclustering AT huanglan minimumdistributionsupportvectorclustering AT zhangshuangquan minimumdistributionsupportvectorclustering AT zhaoshishun minimumdistributionsupportvectorclustering |