Cargando…

An Active Learning Method Based on Variational Autoencoder and DBSCAN Clustering

Active learning is aimed to sample the most informative data from the unlabeled pool, and diverse clustering methods have been applied to it. However, the distance-based clustering methods usually cannot perform well in high dimensions and even begin to fail. In this paper, we propose a new active l...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Fang, Zhang, Tao, Liu, Ruilin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352707/
https://www.ncbi.nlm.nih.gov/pubmed/34381500
http://dx.doi.org/10.1155/2021/9952596
_version_ 1783736242254381056
author Chen, Fang
Zhang, Tao
Liu, Ruilin
author_facet Chen, Fang
Zhang, Tao
Liu, Ruilin
author_sort Chen, Fang
collection PubMed
description Active learning is aimed to sample the most informative data from the unlabeled pool, and diverse clustering methods have been applied to it. However, the distance-based clustering methods usually cannot perform well in high dimensions and even begin to fail. In this paper, we propose a new active learning method combined with variational autoencoder (VAE) and density-based spatial clustering of applications with noise (DBSCAN). It overcomes the difficulty of distance representation in high dimensions and prevents the distance concentration phenomenon from occurring in the computational learning literature with respect to high-dimensional p-norms. Finally, we compare our method with four common active learning methods and two other clustering algorithms combined with VAE on three datasets. The results demonstrate that our approach achieves competitive performance, and it is a new batch mode active learning algorithm designed for neural networks with a relatively small query batch size.
format Online
Article
Text
id pubmed-8352707
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-83527072021-08-10 An Active Learning Method Based on Variational Autoencoder and DBSCAN Clustering Chen, Fang Zhang, Tao Liu, Ruilin Comput Intell Neurosci Research Article Active learning is aimed to sample the most informative data from the unlabeled pool, and diverse clustering methods have been applied to it. However, the distance-based clustering methods usually cannot perform well in high dimensions and even begin to fail. In this paper, we propose a new active learning method combined with variational autoencoder (VAE) and density-based spatial clustering of applications with noise (DBSCAN). It overcomes the difficulty of distance representation in high dimensions and prevents the distance concentration phenomenon from occurring in the computational learning literature with respect to high-dimensional p-norms. Finally, we compare our method with four common active learning methods and two other clustering algorithms combined with VAE on three datasets. The results demonstrate that our approach achieves competitive performance, and it is a new batch mode active learning algorithm designed for neural networks with a relatively small query batch size. Hindawi 2021-07-30 /pmc/articles/PMC8352707/ /pubmed/34381500 http://dx.doi.org/10.1155/2021/9952596 Text en Copyright © 2021 Fang Chen et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chen, Fang
Zhang, Tao
Liu, Ruilin
An Active Learning Method Based on Variational Autoencoder and DBSCAN Clustering
title An Active Learning Method Based on Variational Autoencoder and DBSCAN Clustering
title_full An Active Learning Method Based on Variational Autoencoder and DBSCAN Clustering
title_fullStr An Active Learning Method Based on Variational Autoencoder and DBSCAN Clustering
title_full_unstemmed An Active Learning Method Based on Variational Autoencoder and DBSCAN Clustering
title_short An Active Learning Method Based on Variational Autoencoder and DBSCAN Clustering
title_sort active learning method based on variational autoencoder and dbscan clustering
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352707/
https://www.ncbi.nlm.nih.gov/pubmed/34381500
http://dx.doi.org/10.1155/2021/9952596
work_keys_str_mv AT chenfang anactivelearningmethodbasedonvariationalautoencoderanddbscanclustering
AT zhangtao anactivelearningmethodbasedonvariationalautoencoderanddbscanclustering
AT liuruilin anactivelearningmethodbasedonvariationalautoencoderanddbscanclustering
AT chenfang activelearningmethodbasedonvariationalautoencoderanddbscanclustering
AT zhangtao activelearningmethodbasedonvariationalautoencoderanddbscanclustering
AT liuruilin activelearningmethodbasedonvariationalautoencoderanddbscanclustering