Cargando…

Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models

Human detection in videos plays an important role in various real life applications. Most of traditional approaches depend on utilizing handcrafted features which are problem-dependent and optimal for specific tasks. Moreover, they are highly susceptible to dynamical events such as illumination chan...

Descripción completa

Detalles Bibliográficos
Autores principales: AlDahoul, Nouar, Md Sabri, Aznul Qalid, Mansoor, Ali Mohammed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5829342/
https://www.ncbi.nlm.nih.gov/pubmed/29623089
http://dx.doi.org/10.1155/2018/1639561
_version_ 1783302786266431488
author AlDahoul, Nouar
Md Sabri, Aznul Qalid
Mansoor, Ali Mohammed
author_facet AlDahoul, Nouar
Md Sabri, Aznul Qalid
Mansoor, Ali Mohammed
author_sort AlDahoul, Nouar
collection PubMed
description Human detection in videos plays an important role in various real life applications. Most of traditional approaches depend on utilizing handcrafted features which are problem-dependent and optimal for specific tasks. Moreover, they are highly susceptible to dynamical events such as illumination changes, camera jitter, and variations in object sizes. On the other hand, the proposed feature learning approaches are cheaper and easier because highly abstract and discriminative features can be produced automatically without the need of expert knowledge. In this paper, we utilize automatic feature learning methods which combine optical flow and three different deep models (i.e., supervised convolutional neural network (S-CNN), pretrained CNN feature extractor, and hierarchical extreme learning machine) for human detection in videos captured using a nonstatic camera on an aerial platform with varying altitudes. The models are trained and tested on the publicly available and highly challenging UCF-ARG aerial dataset. The comparison between these models in terms of training, testing accuracy, and learning speed is analyzed. The performance evaluation considers five human actions (digging, waving, throwing, walking, and running). Experimental results demonstrated that the proposed methods are successful for human detection task. Pretrained CNN produces an average accuracy of 98.09%. S-CNN produces an average accuracy of 95.6% with soft-max and 91.7% with Support Vector Machines (SVM). H-ELM has an average accuracy of 95.9%. Using a normal Central Processing Unit (CPU), H-ELM's training time takes 445 seconds. Learning in S-CNN takes 770 seconds with a high performance Graphical Processing Unit (GPU).
format Online
Article
Text
id pubmed-5829342
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-58293422018-04-05 Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models AlDahoul, Nouar Md Sabri, Aznul Qalid Mansoor, Ali Mohammed Comput Intell Neurosci Research Article Human detection in videos plays an important role in various real life applications. Most of traditional approaches depend on utilizing handcrafted features which are problem-dependent and optimal for specific tasks. Moreover, they are highly susceptible to dynamical events such as illumination changes, camera jitter, and variations in object sizes. On the other hand, the proposed feature learning approaches are cheaper and easier because highly abstract and discriminative features can be produced automatically without the need of expert knowledge. In this paper, we utilize automatic feature learning methods which combine optical flow and three different deep models (i.e., supervised convolutional neural network (S-CNN), pretrained CNN feature extractor, and hierarchical extreme learning machine) for human detection in videos captured using a nonstatic camera on an aerial platform with varying altitudes. The models are trained and tested on the publicly available and highly challenging UCF-ARG aerial dataset. The comparison between these models in terms of training, testing accuracy, and learning speed is analyzed. The performance evaluation considers five human actions (digging, waving, throwing, walking, and running). Experimental results demonstrated that the proposed methods are successful for human detection task. Pretrained CNN produces an average accuracy of 98.09%. S-CNN produces an average accuracy of 95.6% with soft-max and 91.7% with Support Vector Machines (SVM). H-ELM has an average accuracy of 95.9%. Using a normal Central Processing Unit (CPU), H-ELM's training time takes 445 seconds. Learning in S-CNN takes 770 seconds with a high performance Graphical Processing Unit (GPU). Hindawi 2018-02-12 /pmc/articles/PMC5829342/ /pubmed/29623089 http://dx.doi.org/10.1155/2018/1639561 Text en Copyright © 2018 Nouar AlDahoul et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
AlDahoul, Nouar
Md Sabri, Aznul Qalid
Mansoor, Ali Mohammed
Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models
title Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models
title_full Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models
title_fullStr Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models
title_full_unstemmed Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models
title_short Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models
title_sort real-time human detection for aerial captured video sequences via deep models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5829342/
https://www.ncbi.nlm.nih.gov/pubmed/29623089
http://dx.doi.org/10.1155/2018/1639561
work_keys_str_mv AT aldahoulnouar realtimehumandetectionforaerialcapturedvideosequencesviadeepmodels
AT mdsabriaznulqalid realtimehumandetectionforaerialcapturedvideosequencesviadeepmodels
AT mansooralimohammed realtimehumandetectionforaerialcapturedvideosequencesviadeepmodels