The Construction of Mathematical Model of Swimmers’ Technical Movements using Multimodal Deep Learning Framework

Authors

  • Mengmeng Wang School of Physical Education, Jiangsu University of Technology, Changzhou 213001, Jiangsu, China
  • Yangwen He Department of Orthopedics, Changzhou West the Taihu Lake Hospital, Changzhou 213149, Jiangsu, China

DOI:

https://doi.org/10.12694/scpe.v26i3.4171

Keywords:

Multimodal deep learning, Space-time module, Translation part of the channel, NetVLAD, Classification o f swimmers

Abstract

This paper proposes a mathematical model construction method based on a multi-modal deep learning framework aiming at the accuracy and real-time requirements of swimmers’ technical movement analysis. The model can extract the image features and timing information of athletes’ movements from video sequences by integrating spatiotemporal modules. This paper introduces the translation partial channel strategy to overcome the limitation of spatiotemporal information separation in traditional methods, which can seamlessly integrate spatiotemporal features and enhance the recognition ability of complex action patterns. In addition, NetVLAD is used as the feature aggregation layer. This layer can capture and encode the global and local features of the athlete’s movements, thereby improving the classifier’s performance. In the experimental part, the model is strictly verified, and the results show that compared with the prior art, the model in this paper shows higher accuracy and faster processing speed in the swimmer’s action classification task. This provides the possibility of immediate feedback for coaches and athletes and lays a solid foundation for further research in the field of sports science.

Downloads

Published

2025-04-01

Issue

Section

Speciai Issue - Deep Learning in Healthcare