Cargando…

Distance-based clustering using QUBO formulations

In computer science, clustering is a technique for grouping data. Ising machines can solve distance-based clustering problems described by quadratic unconstrained binary optimization (QUBO) formulations. A typical simple method using an Ising machine makes each cluster size equal and is not suitable...

Descripción completa

Detalles Bibliográficos
Autores principales: Matsumoto, Nasa, Hamakawa, Yohei, Tatsumura, Kosuke, Kudo, Kazue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8854424/
https://www.ncbi.nlm.nih.gov/pubmed/35177719
http://dx.doi.org/10.1038/s41598-022-06559-z
_version_ 1784653441507786752
author Matsumoto, Nasa
Hamakawa, Yohei
Tatsumura, Kosuke
Kudo, Kazue
author_facet Matsumoto, Nasa
Hamakawa, Yohei
Tatsumura, Kosuke
Kudo, Kazue
author_sort Matsumoto, Nasa
collection PubMed
description In computer science, clustering is a technique for grouping data. Ising machines can solve distance-based clustering problems described by quadratic unconstrained binary optimization (QUBO) formulations. A typical simple method using an Ising machine makes each cluster size equal and is not suitable for clustering unevenly distributed data. We propose a new clustering method that provides better performance than the simple method, especially for unevenly distributed data. The proposed method is a hybrid algorithm including an iterative process that comprises solving a discrete optimization problem with an Ising machine and calculating parameters with a general-purpose computer. To minimize the communication overhead between the Ising machine and the general-purpose computer, we employed a low-latency Ising machine implementing the simulated bifurcation algorithm with a field-programmable gate array attached to a local server. The proposed method results in clustering 200 unevenly distributed data points with a clustering score 18% higher than that of the simple method. The discrete optimization with 2000 variables is performed 100 times per iteration, and the overhead time is reduced to approximately 20% of the total execution time. These results suggest that hybrid algorithms using Ising machines can efficiently solve practical optimization problems.
format Online
Article
Text
id pubmed-8854424
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-88544242022-02-18 Distance-based clustering using QUBO formulations Matsumoto, Nasa Hamakawa, Yohei Tatsumura, Kosuke Kudo, Kazue Sci Rep Article In computer science, clustering is a technique for grouping data. Ising machines can solve distance-based clustering problems described by quadratic unconstrained binary optimization (QUBO) formulations. A typical simple method using an Ising machine makes each cluster size equal and is not suitable for clustering unevenly distributed data. We propose a new clustering method that provides better performance than the simple method, especially for unevenly distributed data. The proposed method is a hybrid algorithm including an iterative process that comprises solving a discrete optimization problem with an Ising machine and calculating parameters with a general-purpose computer. To minimize the communication overhead between the Ising machine and the general-purpose computer, we employed a low-latency Ising machine implementing the simulated bifurcation algorithm with a field-programmable gate array attached to a local server. The proposed method results in clustering 200 unevenly distributed data points with a clustering score 18% higher than that of the simple method. The discrete optimization with 2000 variables is performed 100 times per iteration, and the overhead time is reduced to approximately 20% of the total execution time. These results suggest that hybrid algorithms using Ising machines can efficiently solve practical optimization problems. Nature Publishing Group UK 2022-02-17 /pmc/articles/PMC8854424/ /pubmed/35177719 http://dx.doi.org/10.1038/s41598-022-06559-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Matsumoto, Nasa
Hamakawa, Yohei
Tatsumura, Kosuke
Kudo, Kazue
Distance-based clustering using QUBO formulations
title Distance-based clustering using QUBO formulations
title_full Distance-based clustering using QUBO formulations
title_fullStr Distance-based clustering using QUBO formulations
title_full_unstemmed Distance-based clustering using QUBO formulations
title_short Distance-based clustering using QUBO formulations
title_sort distance-based clustering using qubo formulations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8854424/
https://www.ncbi.nlm.nih.gov/pubmed/35177719
http://dx.doi.org/10.1038/s41598-022-06559-z
work_keys_str_mv AT matsumotonasa distancebasedclusteringusingquboformulations
AT hamakawayohei distancebasedclusteringusingquboformulations
AT tatsumurakosuke distancebasedclusteringusingquboformulations
AT kudokazue distancebasedclusteringusingquboformulations