Cargando…

Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning

Digital societies could be characterized by their increasing desire to express themselves and interact with others. This is being realized through digital platforms such as social media that have increasingly become convenient and inexpensive sensors compared to physical sensors in many sectors of s...

Descripción completa

Detalles Bibliográficos
Autores principales: Alomari, Ebtesam, Katib, Iyad, Albeshri, Aiiad, Yigitcanlar, Tan, Mehmood, Rashid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8123223/
https://www.ncbi.nlm.nih.gov/pubmed/33923247
http://dx.doi.org/10.3390/s21092993
_version_ 1783692839660552192
author Alomari, Ebtesam
Katib, Iyad
Albeshri, Aiiad
Yigitcanlar, Tan
Mehmood, Rashid
author_facet Alomari, Ebtesam
Katib, Iyad
Albeshri, Aiiad
Yigitcanlar, Tan
Mehmood, Rashid
author_sort Alomari, Ebtesam
collection PubMed
description Digital societies could be characterized by their increasing desire to express themselves and interact with others. This is being realized through digital platforms such as social media that have increasingly become convenient and inexpensive sensors compared to physical sensors in many sectors of smart societies. One such major sector is road transportation, which is the backbone of modern economies and costs globally 1.25 million deaths and 50 million human injuries annually. The cutting-edge on big data-enabled social media analytics for transportation-related studies is limited. This paper brings a range of technologies together to detect road traffic-related events using big data and distributed machine learning. The most specific contribution of this research is an automatic labelling method for machine learning-based traffic-related event detection from Twitter data in the Arabic language. The proposed method has been implemented in a software tool called Iktishaf+ (an Arabic word meaning discovery) that is able to detect traffic events automatically from tweets in the Arabic language using distributed machine learning over Apache Spark. The tool is built using nine components and a range of technologies including Apache Spark, Parquet, and MongoDB. Iktishaf+ uses a light stemmer for the Arabic language developed by us. We also use in this work a location extractor developed by us that allows us to extract and visualize spatio-temporal information about the detected events. The specific data used in this work comprises 33.5 million tweets collected from Saudi Arabia using the Twitter API. Using support vector machines, naïve Bayes, and logistic regression-based classifiers, we are able to detect and validate several real events in Saudi Arabia without prior knowledge, including a fire in Jeddah, rains in Makkah, and an accident in Riyadh. The findings show the effectiveness of Twitter media in detecting important events with no prior knowledge about them.
format Online
Article
Text
id pubmed-8123223
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81232232021-05-16 Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning Alomari, Ebtesam Katib, Iyad Albeshri, Aiiad Yigitcanlar, Tan Mehmood, Rashid Sensors (Basel) Article Digital societies could be characterized by their increasing desire to express themselves and interact with others. This is being realized through digital platforms such as social media that have increasingly become convenient and inexpensive sensors compared to physical sensors in many sectors of smart societies. One such major sector is road transportation, which is the backbone of modern economies and costs globally 1.25 million deaths and 50 million human injuries annually. The cutting-edge on big data-enabled social media analytics for transportation-related studies is limited. This paper brings a range of technologies together to detect road traffic-related events using big data and distributed machine learning. The most specific contribution of this research is an automatic labelling method for machine learning-based traffic-related event detection from Twitter data in the Arabic language. The proposed method has been implemented in a software tool called Iktishaf+ (an Arabic word meaning discovery) that is able to detect traffic events automatically from tweets in the Arabic language using distributed machine learning over Apache Spark. The tool is built using nine components and a range of technologies including Apache Spark, Parquet, and MongoDB. Iktishaf+ uses a light stemmer for the Arabic language developed by us. We also use in this work a location extractor developed by us that allows us to extract and visualize spatio-temporal information about the detected events. The specific data used in this work comprises 33.5 million tweets collected from Saudi Arabia using the Twitter API. Using support vector machines, naïve Bayes, and logistic regression-based classifiers, we are able to detect and validate several real events in Saudi Arabia without prior knowledge, including a fire in Jeddah, rains in Makkah, and an accident in Riyadh. The findings show the effectiveness of Twitter media in detecting important events with no prior knowledge about them. MDPI 2021-04-24 /pmc/articles/PMC8123223/ /pubmed/33923247 http://dx.doi.org/10.3390/s21092993 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Alomari, Ebtesam
Katib, Iyad
Albeshri, Aiiad
Yigitcanlar, Tan
Mehmood, Rashid
Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning
title Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning
title_full Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning
title_fullStr Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning
title_full_unstemmed Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning
title_short Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning
title_sort iktishaf+: a big data tool with automatic labeling for road traffic social sensing and event detection using distributed machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8123223/
https://www.ncbi.nlm.nih.gov/pubmed/33923247
http://dx.doi.org/10.3390/s21092993
work_keys_str_mv AT alomariebtesam iktishafabigdatatoolwithautomaticlabelingforroadtrafficsocialsensingandeventdetectionusingdistributedmachinelearning
AT katibiyad iktishafabigdatatoolwithautomaticlabelingforroadtrafficsocialsensingandeventdetectionusingdistributedmachinelearning
AT albeshriaiiad iktishafabigdatatoolwithautomaticlabelingforroadtrafficsocialsensingandeventdetectionusingdistributedmachinelearning
AT yigitcanlartan iktishafabigdatatoolwithautomaticlabelingforroadtrafficsocialsensingandeventdetectionusingdistributedmachinelearning
AT mehmoodrashid iktishafabigdatatoolwithautomaticlabelingforroadtrafficsocialsensingandeventdetectionusingdistributedmachinelearning