Cargando…

Jersey number detection using synthetic data in a low-data regime

Player identification is an essential and complex task in sports video analysis. Different strategies have been devised over the years and identification based on jersey numbers is one of the most common approaches given its versatility and relative simplicity. However, automatic detection of jersey...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhargavi, Divya, Gholami, Sia, Pelaez Coyotl, Erika
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9583843/
https://www.ncbi.nlm.nih.gov/pubmed/36277169
http://dx.doi.org/10.3389/frai.2022.988113
_version_ 1784813157581062144
author Bhargavi, Divya
Gholami, Sia
Pelaez Coyotl, Erika
author_facet Bhargavi, Divya
Gholami, Sia
Pelaez Coyotl, Erika
author_sort Bhargavi, Divya
collection PubMed
description Player identification is an essential and complex task in sports video analysis. Different strategies have been devised over the years and identification based on jersey numbers is one of the most common approaches given its versatility and relative simplicity. However, automatic detection of jersey numbers is challenging due to changing camera angles, low video resolution, small object size in wide-range shots, and transient changes in the player's posture and movement. In this paper, we present a novel approach for jersey number identification in a small, highly imbalanced dataset from the Seattle Seahawks practice videos. We generate novel synthetic datasets of different complexities to mitigate the data imbalance and scarcity in the samples. To show the effectiveness of our synthetic data generation, we use a multi-step strategy that enforces attention to a particular region of interest (player's torso), to identify jersey numbers. The solution first identifies and crops players in a frame using a person detection model, then utilizes a human pose estimation model to localize jersey numbers in the detected players, obviating the need for annotating bounding boxes for number detection. We experimented with two sets of Convolutional Neural Networks (CNNs) with different learning objectives: multi-class for two-digit number identification and multi-label for digit-wise detection to compare performance. Our experiments indicate that our novel synthetic data generation method improves the accuracy of various CNN models by 9% overall, and 18% on low frequency numbers.
format Online
Article
Text
id pubmed-9583843
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95838432022-10-21 Jersey number detection using synthetic data in a low-data regime Bhargavi, Divya Gholami, Sia Pelaez Coyotl, Erika Front Artif Intell Artificial Intelligence Player identification is an essential and complex task in sports video analysis. Different strategies have been devised over the years and identification based on jersey numbers is one of the most common approaches given its versatility and relative simplicity. However, automatic detection of jersey numbers is challenging due to changing camera angles, low video resolution, small object size in wide-range shots, and transient changes in the player's posture and movement. In this paper, we present a novel approach for jersey number identification in a small, highly imbalanced dataset from the Seattle Seahawks practice videos. We generate novel synthetic datasets of different complexities to mitigate the data imbalance and scarcity in the samples. To show the effectiveness of our synthetic data generation, we use a multi-step strategy that enforces attention to a particular region of interest (player's torso), to identify jersey numbers. The solution first identifies and crops players in a frame using a person detection model, then utilizes a human pose estimation model to localize jersey numbers in the detected players, obviating the need for annotating bounding boxes for number detection. We experimented with two sets of Convolutional Neural Networks (CNNs) with different learning objectives: multi-class for two-digit number identification and multi-label for digit-wise detection to compare performance. Our experiments indicate that our novel synthetic data generation method improves the accuracy of various CNN models by 9% overall, and 18% on low frequency numbers. Frontiers Media S.A. 2022-10-06 /pmc/articles/PMC9583843/ /pubmed/36277169 http://dx.doi.org/10.3389/frai.2022.988113 Text en Copyright © 2022 Bhargavi, Gholami and Pelaez Coyotl. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Bhargavi, Divya
Gholami, Sia
Pelaez Coyotl, Erika
Jersey number detection using synthetic data in a low-data regime
title Jersey number detection using synthetic data in a low-data regime
title_full Jersey number detection using synthetic data in a low-data regime
title_fullStr Jersey number detection using synthetic data in a low-data regime
title_full_unstemmed Jersey number detection using synthetic data in a low-data regime
title_short Jersey number detection using synthetic data in a low-data regime
title_sort jersey number detection using synthetic data in a low-data regime
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9583843/
https://www.ncbi.nlm.nih.gov/pubmed/36277169
http://dx.doi.org/10.3389/frai.2022.988113
work_keys_str_mv AT bhargavidivya jerseynumberdetectionusingsyntheticdatainalowdataregime
AT gholamisia jerseynumberdetectionusingsyntheticdatainalowdataregime
AT pelaezcoyotlerika jerseynumberdetectionusingsyntheticdatainalowdataregime