Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment

About

Can performance on the task of action quality assessment (AQA) be improved by exploiting a description of the action and its quality? Current AQA and skills assessment approaches propose to learn features that serve only one task - estimating the final score. In this paper, we propose to learn spatio-temporal features that explain three related tasks - fine-grained action recognition, commentary generation, and estimating the AQA score. A new multitask-AQA dataset, the largest to date, comprising of 1412 diving samples was collected to evaluate our approach (https://github.com/ParitoshParmar/MTL-AQA). We show that our MTL approach outperforms STL approach using two different kinds of architectures: C3D-AVG and MSCADC. The C3D-AVG-MTL approach achieves the new state-of-the-art performance with a rank correlation of 90.44%. Detailed experiments were performed to show that MTL offers better generalization than STL, and representations from action recognition models are not sufficient for the AQA task and instead should be learned.

Paritosh Parmar, Brendan Tran Morris• 2019

Related benchmarks

TaskDatasetResultRank
Action Quality AssessmentMTL-AQA (test)
Spearman Correlation89.6
29
Action Quality AssessmentMTL-AQA
Spearman Correlation0.9044
22
Action Quality AssessmentFineDiving-HM (test)
Spearman Correlation0.8371
9
Action Quality AssessmentMTL-AQA v1 (test)
Spearman Correlation90.44
7
Action Quality AssessmentMTL-NAE
Spearman's Rho0.904
7
Action Quality AssessmentFineGym NAE
Spearman Correlation (ρ)0.701
7
Narrative Action EvaluationMTL-NAE re-annotated (test)
mAP15.7
7
Narrative Action EvaluationFineGym NAE re-annotated (test)
mAP5.1
7
Action Quality AssessmentUNLV Dive 16 (test)
Spearman Correlation88.08
6
Dive RecognitionMTL-AQA (test)
Position Accuracy96.32
6
Showing 10 of 12 rows

Other info

Code

Follow for update