Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

COIN

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video Action ClassificationCOIN
Top-1 Acc95.3
33
Action Phase ClassificationCOIN
Phase Acc54.1
32
Action segmentationCOIN
Frame Accuracy70.02
29
Classification of Procedural ActivitiesCOIN (test)
Accuracy90.81
23
Action SegmentationCOIN (test)
Frame Accuracy72.8
23
Continual Multimodal Instruction TuningCoIN ScienceQA TextVQA ImageNet GQA VizWiz Grounding Chameleon backbone
Accuracy68.71
22
Step ForecastingCOIN
Accuracy56.2
22
Procedure PlanningCOIN T=3 (test)
SR30.12
21
Video ClassificationCOIN (test)
Top-1 Accuracy94.1
20
Keystep recognitionCOIN (test)
Accuracy16.9
18
Task recognitionCOIN
Accuracy90.5
14
Long-Term Video UnderstandingCOIN
Top-1 Acc96
14
Keystep recognitionCOIN
Accuracy57.2
14
Video Question AnsweringCOIN
Accuracy97.8
13
Procedure PlanningCOIN T=4 (test)
SR22.24
13
Next forecastingCOIN (test)
Top-1 Accuracy54.1
13
Action RecognitionCOIN
Top-1 Acc90.4
12
Procedural Activities ClassificationCOIN
Accuracy90
12
Step recognitionCOIN (test)
Top-1 Acc66.4
11
Symbolic ReasoningCoin
Accuracy100
11
Instructional Video UnderstandingCOIN (test)
Step Recognition Top-1 Acc63.4
10
Task recognitionCOIN (test)
Top-1 Acc92.7
9
Step localizationCOIN
Accuracy59.6
8
Procedure PlanningCOIN T=5 (test)
SR16.06
8
Visual Planners for human AssistanceCOIN (test)
SR (T=3)25.5
6
Showing 25 of 36 rows