Cargando…

Sports competition tactical analysis model of cross-modal transfer learning intelligent robot based on Swin Transformer and CLIP

INTRODUCTION: This paper presents an innovative Intelligent Robot Sports Competition Tactical Analysis Model that leverages multimodal perception to tackle the pressing challenge of analyzing opponent tactics in sports competitions. The current landscape of sports competition analysis necessitates a...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Li, Lu, Wang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10642548/
https://www.ncbi.nlm.nih.gov/pubmed/37965071
http://dx.doi.org/10.3389/fnbot.2023.1275645
_version_ 1785146988447137792
author Jiang, Li
Lu, Wang
author_facet Jiang, Li
Lu, Wang
author_sort Jiang, Li
collection PubMed
description INTRODUCTION: This paper presents an innovative Intelligent Robot Sports Competition Tactical Analysis Model that leverages multimodal perception to tackle the pressing challenge of analyzing opponent tactics in sports competitions. The current landscape of sports competition analysis necessitates a comprehensive understanding of opponent strategies. However, traditional methods are often constrained to a single data source or modality, limiting their ability to capture the intricate details of opponent tactics. METHODS: Our system integrates the Swin Transformer and CLIP models, harnessing cross-modal transfer learning to enable a holistic observation and analysis of opponent tactics. The Swin Transformer is employed to acquire knowledge about opponent action postures and behavioral patterns in basketball or football games, while the CLIP model enhances the system's comprehension of opponent tactical information by establishing semantic associations between images and text. To address potential imbalances and biases between these models, we introduce a cross-modal transfer learning technique that mitigates modal bias issues, thereby enhancing the model's generalization performance on multimodal data. RESULTS: Through cross-modal transfer learning, tactical information learned from images by the Swin Transformer is effectively transferred to the CLIP model, providing coaches and athletes with comprehensive tactical insights. Our method is rigorously tested and validated using Sport UV, Sports-1M, HMDB51, and NPU RGB+D datasets. Experimental results demonstrate the system's impressive performance in terms of prediction accuracy, stability, training time, inference time, number of parameters, and computational complexity. Notably, the system outperforms other models, with a remarkable 8.47% lower prediction error (MAE) on the Kinetics dataset, accompanied by a 72.86-second reduction in training time. DISCUSSION: The presented system proves to be highly suitable for real-time sports competition assistance and analysis, offering a novel and effective approach for an Intelligent Robot Sports Competition Tactical Analysis Model that maximizes the potential of multimodal perception technology. By harnessing the synergies between the Swin Transformer and CLIP models, we address the limitations of traditional methods and significantly advance the field of sports competition analysis. This innovative model opens up new avenues for comprehensive tactical analysis in sports, benefiting coaches, athletes, and sports enthusiasts alike.
format Online
Article
Text
id pubmed-10642548
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-106425482023-11-14 Sports competition tactical analysis model of cross-modal transfer learning intelligent robot based on Swin Transformer and CLIP Jiang, Li Lu, Wang Front Neurorobot Neuroscience INTRODUCTION: This paper presents an innovative Intelligent Robot Sports Competition Tactical Analysis Model that leverages multimodal perception to tackle the pressing challenge of analyzing opponent tactics in sports competitions. The current landscape of sports competition analysis necessitates a comprehensive understanding of opponent strategies. However, traditional methods are often constrained to a single data source or modality, limiting their ability to capture the intricate details of opponent tactics. METHODS: Our system integrates the Swin Transformer and CLIP models, harnessing cross-modal transfer learning to enable a holistic observation and analysis of opponent tactics. The Swin Transformer is employed to acquire knowledge about opponent action postures and behavioral patterns in basketball or football games, while the CLIP model enhances the system's comprehension of opponent tactical information by establishing semantic associations between images and text. To address potential imbalances and biases between these models, we introduce a cross-modal transfer learning technique that mitigates modal bias issues, thereby enhancing the model's generalization performance on multimodal data. RESULTS: Through cross-modal transfer learning, tactical information learned from images by the Swin Transformer is effectively transferred to the CLIP model, providing coaches and athletes with comprehensive tactical insights. Our method is rigorously tested and validated using Sport UV, Sports-1M, HMDB51, and NPU RGB+D datasets. Experimental results demonstrate the system's impressive performance in terms of prediction accuracy, stability, training time, inference time, number of parameters, and computational complexity. Notably, the system outperforms other models, with a remarkable 8.47% lower prediction error (MAE) on the Kinetics dataset, accompanied by a 72.86-second reduction in training time. DISCUSSION: The presented system proves to be highly suitable for real-time sports competition assistance and analysis, offering a novel and effective approach for an Intelligent Robot Sports Competition Tactical Analysis Model that maximizes the potential of multimodal perception technology. By harnessing the synergies between the Swin Transformer and CLIP models, we address the limitations of traditional methods and significantly advance the field of sports competition analysis. This innovative model opens up new avenues for comprehensive tactical analysis in sports, benefiting coaches, athletes, and sports enthusiasts alike. Frontiers Media S.A. 2023-10-30 /pmc/articles/PMC10642548/ /pubmed/37965071 http://dx.doi.org/10.3389/fnbot.2023.1275645 Text en Copyright © 2023 Jiang and Lu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Jiang, Li
Lu, Wang
Sports competition tactical analysis model of cross-modal transfer learning intelligent robot based on Swin Transformer and CLIP
title Sports competition tactical analysis model of cross-modal transfer learning intelligent robot based on Swin Transformer and CLIP
title_full Sports competition tactical analysis model of cross-modal transfer learning intelligent robot based on Swin Transformer and CLIP
title_fullStr Sports competition tactical analysis model of cross-modal transfer learning intelligent robot based on Swin Transformer and CLIP
title_full_unstemmed Sports competition tactical analysis model of cross-modal transfer learning intelligent robot based on Swin Transformer and CLIP
title_short Sports competition tactical analysis model of cross-modal transfer learning intelligent robot based on Swin Transformer and CLIP
title_sort sports competition tactical analysis model of cross-modal transfer learning intelligent robot based on swin transformer and clip
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10642548/
https://www.ncbi.nlm.nih.gov/pubmed/37965071
http://dx.doi.org/10.3389/fnbot.2023.1275645
work_keys_str_mv AT jiangli sportscompetitiontacticalanalysismodelofcrossmodaltransferlearningintelligentrobotbasedonswintransformerandclip
AT luwang sportscompetitiontacticalanalysismodelofcrossmodaltransferlearningintelligentrobotbasedonswintransformerandclip