Cargando…

Survey on Exact kNN Queries over High-Dimensional Data Space

k nearest neighbours (kNN) queries are fundamental in many applications, ranging from data mining, recommendation system and Internet of Things, to Industry 4.0 framework applications. In mining, specifically, it can be used for the classification of human activities, iterative closest point registr...

Descripción completa

Detalles Bibliográficos
Autores principales: Ukey, Nimish, Yang, Zhengyi, Li, Binghao, Zhang, Guangjian, Hu, Yiheng, Zhang, Wenjie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9861271/
https://www.ncbi.nlm.nih.gov/pubmed/36679422
http://dx.doi.org/10.3390/s23020629
_version_ 1784874799643754496
author Ukey, Nimish
Yang, Zhengyi
Li, Binghao
Zhang, Guangjian
Hu, Yiheng
Zhang, Wenjie
author_facet Ukey, Nimish
Yang, Zhengyi
Li, Binghao
Zhang, Guangjian
Hu, Yiheng
Zhang, Wenjie
author_sort Ukey, Nimish
collection PubMed
description k nearest neighbours (kNN) queries are fundamental in many applications, ranging from data mining, recommendation system and Internet of Things, to Industry 4.0 framework applications. In mining, specifically, it can be used for the classification of human activities, iterative closest point registration and pattern recognition and has also been helpful for intrusion detection systems and fault detection. Due to the importance of kNN queries, many algorithms have been proposed in the literature, for both static and dynamic data. In this paper, we focus on exact kNN queries and present a comprehensive survey of exact kNN queries. In particular, we study two fundamental types of exact kNN queries: the kNN Search queries and the kNN Join queries. Our survey focuses on exact approaches over high-dimensional data space, which covers 20 kNN Search methods and 9 kNN Join methods. To the best of our knowledge, this is the first work of a comprehensive survey of exact kNN queries over high-dimensional datasets. We specifically categorise the algorithms based on indexing strategies, data and space partitioning strategies, clustering techniques and the computing paradigm. We provide useful insights for the evolution of approaches based on the various categorisation factors, as well as the possibility of further expansion. Lastly, we discuss some open challenges and future research directions.
format Online
Article
Text
id pubmed-9861271
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-98612712023-01-22 Survey on Exact kNN Queries over High-Dimensional Data Space Ukey, Nimish Yang, Zhengyi Li, Binghao Zhang, Guangjian Hu, Yiheng Zhang, Wenjie Sensors (Basel) Review k nearest neighbours (kNN) queries are fundamental in many applications, ranging from data mining, recommendation system and Internet of Things, to Industry 4.0 framework applications. In mining, specifically, it can be used for the classification of human activities, iterative closest point registration and pattern recognition and has also been helpful for intrusion detection systems and fault detection. Due to the importance of kNN queries, many algorithms have been proposed in the literature, for both static and dynamic data. In this paper, we focus on exact kNN queries and present a comprehensive survey of exact kNN queries. In particular, we study two fundamental types of exact kNN queries: the kNN Search queries and the kNN Join queries. Our survey focuses on exact approaches over high-dimensional data space, which covers 20 kNN Search methods and 9 kNN Join methods. To the best of our knowledge, this is the first work of a comprehensive survey of exact kNN queries over high-dimensional datasets. We specifically categorise the algorithms based on indexing strategies, data and space partitioning strategies, clustering techniques and the computing paradigm. We provide useful insights for the evolution of approaches based on the various categorisation factors, as well as the possibility of further expansion. Lastly, we discuss some open challenges and future research directions. MDPI 2023-01-05 /pmc/articles/PMC9861271/ /pubmed/36679422 http://dx.doi.org/10.3390/s23020629 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Ukey, Nimish
Yang, Zhengyi
Li, Binghao
Zhang, Guangjian
Hu, Yiheng
Zhang, Wenjie
Survey on Exact kNN Queries over High-Dimensional Data Space
title Survey on Exact kNN Queries over High-Dimensional Data Space
title_full Survey on Exact kNN Queries over High-Dimensional Data Space
title_fullStr Survey on Exact kNN Queries over High-Dimensional Data Space
title_full_unstemmed Survey on Exact kNN Queries over High-Dimensional Data Space
title_short Survey on Exact kNN Queries over High-Dimensional Data Space
title_sort survey on exact knn queries over high-dimensional data space
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9861271/
https://www.ncbi.nlm.nih.gov/pubmed/36679422
http://dx.doi.org/10.3390/s23020629
work_keys_str_mv AT ukeynimish surveyonexactknnqueriesoverhighdimensionaldataspace
AT yangzhengyi surveyonexactknnqueriesoverhighdimensionaldataspace
AT libinghao surveyonexactknnqueriesoverhighdimensionaldataspace
AT zhangguangjian surveyonexactknnqueriesoverhighdimensionaldataspace
AT huyiheng surveyonexactknnqueriesoverhighdimensionaldataspace
AT zhangwenjie surveyonexactknnqueriesoverhighdimensionaldataspace