Cargando…
Scene Classification for Sports Video Summarization Using Transfer Learning
This paper proposes a novel method for sports video scene classification with the particular intention of video summarization. Creating and publishing a shorter version of the video is more interesting than a full version due to instant entertainment. Generating shorter summaries of the videos is a...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146586/ https://www.ncbi.nlm.nih.gov/pubmed/32197502 http://dx.doi.org/10.3390/s20061702 |
_version_ | 1783520235456823296 |
---|---|
author | Rafiq, Muhammad Rafiq, Ghazala Agyeman, Rockson Choi, Gyu Sang Jin, Seong-Il |
author_facet | Rafiq, Muhammad Rafiq, Ghazala Agyeman, Rockson Choi, Gyu Sang Jin, Seong-Il |
author_sort | Rafiq, Muhammad |
collection | PubMed |
description | This paper proposes a novel method for sports video scene classification with the particular intention of video summarization. Creating and publishing a shorter version of the video is more interesting than a full version due to instant entertainment. Generating shorter summaries of the videos is a tedious task that requires significant labor hours and unnecessary machine occupation. Due to the growing demand for video summarization in marketing, advertising agencies, awareness videos, documentaries, and other interest groups, researchers are continuously proposing automation frameworks and novel schemes. Since the scene classification is a fundamental component of video summarization and video analysis, the quality of scene classification is particularly important. This article focuses on various practical implementation gaps over the existing techniques and presents a method to achieve high-quality of scene classification. We consider cricket as a case study and classify five scene categories, i.e., batting, bowling, boundary, crowd and close-up. We employ our model using pre-trained AlexNet Convolutional Neural Network (CNN) for scene classification. The proposed method employs new, fully connected layers in an encoder fashion. We employ data augmentation to achieve a high accuracy of 99.26% over a smaller dataset. We conduct a performance comparison against baseline approaches to prove the superiority of the method as well as state-of-the-art models. We evaluate our performance results on cricket videos and compare various deep-learning models, i.e., Inception V3, Visual Geometry Group (VGGNet16, VGGNet19), Residual Network (ResNet50), and AlexNet. Our experiments demonstrate that our method with AlexNet CNN produces better results than existing proposals. |
format | Online Article Text |
id | pubmed-7146586 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-71465862020-04-20 Scene Classification for Sports Video Summarization Using Transfer Learning Rafiq, Muhammad Rafiq, Ghazala Agyeman, Rockson Choi, Gyu Sang Jin, Seong-Il Sensors (Basel) Article This paper proposes a novel method for sports video scene classification with the particular intention of video summarization. Creating and publishing a shorter version of the video is more interesting than a full version due to instant entertainment. Generating shorter summaries of the videos is a tedious task that requires significant labor hours and unnecessary machine occupation. Due to the growing demand for video summarization in marketing, advertising agencies, awareness videos, documentaries, and other interest groups, researchers are continuously proposing automation frameworks and novel schemes. Since the scene classification is a fundamental component of video summarization and video analysis, the quality of scene classification is particularly important. This article focuses on various practical implementation gaps over the existing techniques and presents a method to achieve high-quality of scene classification. We consider cricket as a case study and classify five scene categories, i.e., batting, bowling, boundary, crowd and close-up. We employ our model using pre-trained AlexNet Convolutional Neural Network (CNN) for scene classification. The proposed method employs new, fully connected layers in an encoder fashion. We employ data augmentation to achieve a high accuracy of 99.26% over a smaller dataset. We conduct a performance comparison against baseline approaches to prove the superiority of the method as well as state-of-the-art models. We evaluate our performance results on cricket videos and compare various deep-learning models, i.e., Inception V3, Visual Geometry Group (VGGNet16, VGGNet19), Residual Network (ResNet50), and AlexNet. Our experiments demonstrate that our method with AlexNet CNN produces better results than existing proposals. MDPI 2020-03-18 /pmc/articles/PMC7146586/ /pubmed/32197502 http://dx.doi.org/10.3390/s20061702 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Rafiq, Muhammad Rafiq, Ghazala Agyeman, Rockson Choi, Gyu Sang Jin, Seong-Il Scene Classification for Sports Video Summarization Using Transfer Learning |
title | Scene Classification for Sports Video Summarization Using Transfer Learning |
title_full | Scene Classification for Sports Video Summarization Using Transfer Learning |
title_fullStr | Scene Classification for Sports Video Summarization Using Transfer Learning |
title_full_unstemmed | Scene Classification for Sports Video Summarization Using Transfer Learning |
title_short | Scene Classification for Sports Video Summarization Using Transfer Learning |
title_sort | scene classification for sports video summarization using transfer learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146586/ https://www.ncbi.nlm.nih.gov/pubmed/32197502 http://dx.doi.org/10.3390/s20061702 |
work_keys_str_mv | AT rafiqmuhammad sceneclassificationforsportsvideosummarizationusingtransferlearning AT rafiqghazala sceneclassificationforsportsvideosummarizationusingtransferlearning AT agyemanrockson sceneclassificationforsportsvideosummarizationusingtransferlearning AT choigyusang sceneclassificationforsportsvideosummarizationusingtransferlearning AT jinseongil sceneclassificationforsportsvideosummarizationusingtransferlearning |