Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

COIN

Benchmarks

Task NameDataset NameSOTA ResultTrend
Symbolic ReasoningCoin
Accuracy100
45
Procedure PlanningCOIN T=3 (test)
SR30.12
40
Video Action ClassificationCOIN
Top-1 Acc95.3
33
Action Phase ClassificationCOIN
Phase Acc54.1
32
Action segmentationCOIN
Frame Accuracy70.02
29
Step ForecastingCOIN
Accuracy56.2
26
Classification of Procedural ActivitiesCOIN (test)
Accuracy90.81
23
Action SegmentationCOIN (test)
Frame Accuracy72.8
23
Visual PlanningCOIN
Success Rate (SR)33.99
22
Task recognitionCOIN
Accuracy94.5
22
Continual Multimodal Instruction TuningCoIN ScienceQA TextVQA ImageNet GQA VizWiz Grounding Chameleon backbone
Accuracy68.71
22
Procedure PlanningCOIN T=4 (test)
SR31.56
21
Goal-conditioned visual planningCOIN T=4 71
SR27.79
20
Goal-conditioned visual planningCOIN T=3 71
Success Rate (SR)34.85
20
Continual LearningCoIN
Backward Transfer (BWT)-4.67
20
Video ClassificationCOIN (test)
Top-1 Accuracy94.1
20
Keystep recognitionCOIN (test)
Accuracy16.9
18
Long-Term Video UnderstandingCOIN
Top-1 Acc96
14
Keystep recognitionCOIN
Accuracy57.2
14
Goal-conditioned visual planningCOIN T=4 71 (test)
Success Rate (SR)33.29
13
Goal-conditioned visual planningCOIN T=3 71 (test)
SR45.29
13
Consistent Video RetrievalCOIN (test)
Accuracy51.64
13
Video Question AnsweringCOIN
Accuracy97.8
13
Next forecastingCOIN (test)
Top-1 Accuracy54.1
13
Step recognitionCOIN
Top-1 Accuracy67.3
12
Showing 25 of 57 rows