Cargando…

Automatic Annotation of Unlabeled Data from Smartphone-Based Motion and Location Sensors

Automatic data annotation eliminates most of the challenges we faced due to the manual methods of annotating sensor data. It significantly improves users’ experience during sensing activities since their active involvement in the labeling process is reduced. An unsupervised learning technique such a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pius Owoh, Nsikak, Mahinderjit Singh, Manmeet, Zaaba, Zarul Fitri
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6069149/ https://www.ncbi.nlm.nih.gov/pubmed/29970823 http://dx.doi.org/10.3390/s18072134

_version_	1783343434167222272
author	Pius Owoh, Nsikak Mahinderjit Singh, Manmeet Zaaba, Zarul Fitri
author_facet	Pius Owoh, Nsikak Mahinderjit Singh, Manmeet Zaaba, Zarul Fitri
author_sort	Pius Owoh, Nsikak
collection	PubMed
description	Automatic data annotation eliminates most of the challenges we faced due to the manual methods of annotating sensor data. It significantly improves users’ experience during sensing activities since their active involvement in the labeling process is reduced. An unsupervised learning technique such as clustering can be used to automatically annotate sensor data. However, the lingering issue with clustering is the validation of generated clusters. In this paper, we adopted the k-means clustering algorithm for annotating unlabeled sensor data for the purpose of detecting sensitive location information of mobile crowd sensing users. Furthermore, we proposed a cluster validation index for the k-means algorithm, which is based on Multiple Pair-Frequency. Thereafter, we trained three classifiers (Support Vector Machine, K-Nearest Neighbor, and Naïve Bayes) using cluster labels generated from the k-means clustering algorithm. The accuracy, precision, and recall of these classifiers were evaluated during the classification of “non-sensitive” and “sensitive” data from motion and location sensors. Very high accuracy scores were recorded from Support Vector Machine and K-Nearest Neighbor classifiers while a fairly high accuracy score was recorded from the Naïve Bayes classifier. With the hybridized machine learning (unsupervised and supervised) technique presented in this paper, unlabeled sensor data was automatically annotated and then classified.
format	Online Article Text
id	pubmed-6069149
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-60691492018-08-07 Automatic Annotation of Unlabeled Data from Smartphone-Based Motion and Location Sensors Pius Owoh, Nsikak Mahinderjit Singh, Manmeet Zaaba, Zarul Fitri Sensors (Basel) Article Automatic data annotation eliminates most of the challenges we faced due to the manual methods of annotating sensor data. It significantly improves users’ experience during sensing activities since their active involvement in the labeling process is reduced. An unsupervised learning technique such as clustering can be used to automatically annotate sensor data. However, the lingering issue with clustering is the validation of generated clusters. In this paper, we adopted the k-means clustering algorithm for annotating unlabeled sensor data for the purpose of detecting sensitive location information of mobile crowd sensing users. Furthermore, we proposed a cluster validation index for the k-means algorithm, which is based on Multiple Pair-Frequency. Thereafter, we trained three classifiers (Support Vector Machine, K-Nearest Neighbor, and Naïve Bayes) using cluster labels generated from the k-means clustering algorithm. The accuracy, precision, and recall of these classifiers were evaluated during the classification of “non-sensitive” and “sensitive” data from motion and location sensors. Very high accuracy scores were recorded from Support Vector Machine and K-Nearest Neighbor classifiers while a fairly high accuracy score was recorded from the Naïve Bayes classifier. With the hybridized machine learning (unsupervised and supervised) technique presented in this paper, unlabeled sensor data was automatically annotated and then classified. MDPI 2018-07-03 /pmc/articles/PMC6069149/ /pubmed/29970823 http://dx.doi.org/10.3390/s18072134 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Pius Owoh, Nsikak Mahinderjit Singh, Manmeet Zaaba, Zarul Fitri Automatic Annotation of Unlabeled Data from Smartphone-Based Motion and Location Sensors
title	Automatic Annotation of Unlabeled Data from Smartphone-Based Motion and Location Sensors
title_full	Automatic Annotation of Unlabeled Data from Smartphone-Based Motion and Location Sensors
title_fullStr	Automatic Annotation of Unlabeled Data from Smartphone-Based Motion and Location Sensors
title_full_unstemmed	Automatic Annotation of Unlabeled Data from Smartphone-Based Motion and Location Sensors
title_short	Automatic Annotation of Unlabeled Data from Smartphone-Based Motion and Location Sensors
title_sort	automatic annotation of unlabeled data from smartphone-based motion and location sensors
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6069149/ https://www.ncbi.nlm.nih.gov/pubmed/29970823 http://dx.doi.org/10.3390/s18072134
work_keys_str_mv	AT piusowohnsikak automaticannotationofunlabeleddatafromsmartphonebasedmotionandlocationsensors AT mahinderjitsinghmanmeet automaticannotationofunlabeleddatafromsmartphonebasedmotionandlocationsensors AT zaabazarulfitri automaticannotationofunlabeleddatafromsmartphonebasedmotionandlocationsensors

Automatic Annotation of Unlabeled Data from Smartphone-Based Motion and Location Sensors

Ejemplares similares