Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Task recognition on COIN
Loading...
94.5
Accuracy
GAD
64.236
72.093
79.95
87.807
Jul 17, 2023
Dec 24, 2023
Jun 1, 2024
Nov 8, 2024
Apr 17, 2025
Sep 24, 2025
Mar 3, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
GAD
Backbone=Llama3-8B-Ins...
2026.03
94.5
Disc
Backbone=Llama3-8B-Ins...
2026.03
94.3
GAD
Backbone=Llama3.2-1B-I...
2026.03
93.5
StreamMind-8B
Backbone=8B
2026.03
93.2
Videollm-MoD-8B
Backbone=8B
2026.03
92.8
Disc
Backbone=Llama3.2-1B-I...
2026.03
92.8
Videollm-online-8B
Backbone=8B
2026.03
92.7
Ours
Downstream architectur...
2023.07
90.5
VideoTaskGraph
2026.03
90.5
DistantSup.
Downstream architectur...
2023.07
90
Ours
Downstream architectur...
2023.07
89.4
TimeSformer
Downstream architectur...
2023.07
88.9
DistantSup.
Downstream architectur...
2023.07
88.2
TimeSformer
Downstream architectur...
2023.07
87
VideoCLIP
Downstream architectur...
2023.07
82.9
TSN
Downstream architectur...
2023.07
73.4
VideoCLIP
Downstream architectur...
2023.07
72.5
SlowFast
Downstream architectur...
2023.07
72.4
SlowFast
Downstream architectur...
2023.07
71.6
S3D
Downstream architectur...
2023.07
70.2
S3D
Downstream architectur...
2023.07
68.5
ClipBERT
Downstream architectur...
2023.07
65.4
Feedback
Search any
task
Search any
task