Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs

About

Figure skating scoring is challenging because it requires judging the technical moves of the players as well as their coordination with the background music. Most learning-based methods cannot solve it well for two reasons: 1) each move in figure skating changes quickly, hence simply applying traditional frame sampling will lose a lot of valuable information, especially in 3 to 5 minutes long videos; 2) prior methods rarely considered the critical audio-visual relationship in their models. Due to these reasons, we introduce a novel architecture, named Skating-Mixer. It extends the MLP framework into a multimodal fashion and effectively learns long-term representations through our designed memory recurrent unit (MRU). Aside from the model, we collected a high-quality audio-visual FS1000 dataset, which contains over 1000 videos on 8 types of programs with 7 different rating metrics, overtaking other datasets in both quantity and diversity. Experiments show the proposed method achieves SOTAs over all major metrics on the public Fis-V and our FS1000 dataset. In addition, we include an analysis applying our method to the recent competitions in Beijing 2022 Winter Olympic Games, proving our method has strong applicability.

Jingfei Xia, Mingchen Zhuge, Tiantian Geng, Shun Fan, Yuantai Wei, Zhenyu He, Feng Zheng• 2022

Related benchmarks

TaskDatasetResultRank
Action Quality AssessmentFis-V
TES Spearman Correlation0.68
22
Action Quality AssessmentFis-V 2-class
Spearman Correlation ({v, f})0.732
9
Action Quality AssessmentRG 4-class
Spearman Correlation ({v, f})0.733
9
Action Quality AssessmentFS1000 7-class
Spearman Correlation ({v, f})0.722
9
Sequence-to-score reasoningFS1000 1.0 (test)
Average SRCC0.52
9
Action Quality AssessmentFS1000
TES Spearman Correlation0.88
8
Action Quality AssessmentRG
Spearman Correlation (Ball)0.677
8
Showing 7 of 7 rows

Other info

Follow for update