Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

COIN

Benchmarks

Task NameDataset NameSOTA ResultTrend
Symbolic ReasoningCoin
Accuracy100
45
Procedure PlanningCOIN T=3 (test)
SR30.12
40
Video Action ClassificationCOIN
Top-1 Acc95.3
33
Action Phase ClassificationCOIN
Phase Acc54.1
32
Action segmentationCOIN
Frame Accuracy70.02
29
Step ForecastingCOIN
Accuracy56.2
26
Classification of Procedural ActivitiesCOIN (test)
Accuracy90.81
23
Action SegmentationCOIN (test)
Frame Accuracy72.8
23
Visual PlanningCOIN
Success Rate (SR)33.99
22
Task recognitionCOIN
Accuracy94.5
22
Continual Multimodal Instruction TuningCoIN ScienceQA TextVQA ImageNet GQA VizWiz Grounding Chameleon backbone
Accuracy68.71
22
Procedure PlanningCOIN T=4 (test)
SR31.56
21
Continual LearningCoIN
Backward Transfer (BWT)-4.67
20
Video ClassificationCOIN (test)
Top-1 Accuracy94.1
20
Keystep recognitionCOIN (test)
Accuracy16.9
18
Long-Term Video UnderstandingCOIN
Top-1 Acc96
14
Keystep recognitionCOIN
Accuracy57.2
14
Consistent Video RetrievalCOIN (test)
Accuracy51.64
13
Video Question AnsweringCOIN
Accuracy97.8
13
Next forecastingCOIN (test)
Top-1 Accuracy54.1
13
Step recognitionCOIN
Top-1 Accuracy67.3
12
Action RecognitionCOIN
Top-1 Acc90.4
12
Procedural Activities ClassificationCOIN
Accuracy90
12
Step recognitionCOIN (test)
Top-1 Acc66.4
11
Instructional Video UnderstandingCOIN (test)
Step Recognition Top-1 Acc63.4
10
Showing 25 of 48 rows