Cargando…

Scene Classification for Sports Video Summarization Using Transfer Learning

This paper proposes a novel method for sports video scene classification with the particular intention of video summarization. Creating and publishing a shorter version of the video is more interesting than a full version due to instant entertainment. Generating shorter summaries of the videos is a...

Descripción completa

Detalles Bibliográficos
Autores principales: Rafiq, Muhammad, Rafiq, Ghazala, Agyeman, Rockson, Choi, Gyu Sang, Jin, Seong-Il
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146586/
https://www.ncbi.nlm.nih.gov/pubmed/32197502
http://dx.doi.org/10.3390/s20061702
_version_ 1783520235456823296
author Rafiq, Muhammad
Rafiq, Ghazala
Agyeman, Rockson
Choi, Gyu Sang
Jin, Seong-Il
author_facet Rafiq, Muhammad
Rafiq, Ghazala
Agyeman, Rockson
Choi, Gyu Sang
Jin, Seong-Il
author_sort Rafiq, Muhammad
collection PubMed
description This paper proposes a novel method for sports video scene classification with the particular intention of video summarization. Creating and publishing a shorter version of the video is more interesting than a full version due to instant entertainment. Generating shorter summaries of the videos is a tedious task that requires significant labor hours and unnecessary machine occupation. Due to the growing demand for video summarization in marketing, advertising agencies, awareness videos, documentaries, and other interest groups, researchers are continuously proposing automation frameworks and novel schemes. Since the scene classification is a fundamental component of video summarization and video analysis, the quality of scene classification is particularly important. This article focuses on various practical implementation gaps over the existing techniques and presents a method to achieve high-quality of scene classification. We consider cricket as a case study and classify five scene categories, i.e., batting, bowling, boundary, crowd and close-up. We employ our model using pre-trained AlexNet Convolutional Neural Network (CNN) for scene classification. The proposed method employs new, fully connected layers in an encoder fashion. We employ data augmentation to achieve a high accuracy of 99.26% over a smaller dataset. We conduct a performance comparison against baseline approaches to prove the superiority of the method as well as state-of-the-art models. We evaluate our performance results on cricket videos and compare various deep-learning models, i.e., Inception V3, Visual Geometry Group (VGGNet16, VGGNet19), Residual Network (ResNet50), and AlexNet. Our experiments demonstrate that our method with AlexNet CNN produces better results than existing proposals.
format Online
Article
Text
id pubmed-7146586
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-71465862020-04-20 Scene Classification for Sports Video Summarization Using Transfer Learning Rafiq, Muhammad Rafiq, Ghazala Agyeman, Rockson Choi, Gyu Sang Jin, Seong-Il Sensors (Basel) Article This paper proposes a novel method for sports video scene classification with the particular intention of video summarization. Creating and publishing a shorter version of the video is more interesting than a full version due to instant entertainment. Generating shorter summaries of the videos is a tedious task that requires significant labor hours and unnecessary machine occupation. Due to the growing demand for video summarization in marketing, advertising agencies, awareness videos, documentaries, and other interest groups, researchers are continuously proposing automation frameworks and novel schemes. Since the scene classification is a fundamental component of video summarization and video analysis, the quality of scene classification is particularly important. This article focuses on various practical implementation gaps over the existing techniques and presents a method to achieve high-quality of scene classification. We consider cricket as a case study and classify five scene categories, i.e., batting, bowling, boundary, crowd and close-up. We employ our model using pre-trained AlexNet Convolutional Neural Network (CNN) for scene classification. The proposed method employs new, fully connected layers in an encoder fashion. We employ data augmentation to achieve a high accuracy of 99.26% over a smaller dataset. We conduct a performance comparison against baseline approaches to prove the superiority of the method as well as state-of-the-art models. We evaluate our performance results on cricket videos and compare various deep-learning models, i.e., Inception V3, Visual Geometry Group (VGGNet16, VGGNet19), Residual Network (ResNet50), and AlexNet. Our experiments demonstrate that our method with AlexNet CNN produces better results than existing proposals. MDPI 2020-03-18 /pmc/articles/PMC7146586/ /pubmed/32197502 http://dx.doi.org/10.3390/s20061702 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Rafiq, Muhammad
Rafiq, Ghazala
Agyeman, Rockson
Choi, Gyu Sang
Jin, Seong-Il
Scene Classification for Sports Video Summarization Using Transfer Learning
title Scene Classification for Sports Video Summarization Using Transfer Learning
title_full Scene Classification for Sports Video Summarization Using Transfer Learning
title_fullStr Scene Classification for Sports Video Summarization Using Transfer Learning
title_full_unstemmed Scene Classification for Sports Video Summarization Using Transfer Learning
title_short Scene Classification for Sports Video Summarization Using Transfer Learning
title_sort scene classification for sports video summarization using transfer learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146586/
https://www.ncbi.nlm.nih.gov/pubmed/32197502
http://dx.doi.org/10.3390/s20061702
work_keys_str_mv AT rafiqmuhammad sceneclassificationforsportsvideosummarizationusingtransferlearning
AT rafiqghazala sceneclassificationforsportsvideosummarizationusingtransferlearning
AT agyemanrockson sceneclassificationforsportsvideosummarizationusingtransferlearning
AT choigyusang sceneclassificationforsportsvideosummarizationusingtransferlearning
AT jinseongil sceneclassificationforsportsvideosummarizationusingtransferlearning