Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Atomic Action Recognition on SceneTeract (test)
Loading...
77.1
Accuracy
Gemini-3-Flash-Preview
32.796
44.298
55.8
67.302
Mar 31, 2026
Accuracy
Updated 18d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini-3-Flash-Preview
Evaluation Setting=Dec...
2026.03
77.1
Qwen3-VL-4B-Instruct with GRPO
Evaluation Setting=Dec...
2026.03
75.3
Gemma3-12B-Instruct
Evaluation Setting=Dec...
2026.03
75
Gemma3-4B-Instruct
Evaluation Setting=Dec...
2026.03
74.8
Claude-Sonnet-4-6
Evaluation Setting=Dec...
2026.03
73.1
Gemini-3.1-Pro-Preview
Evaluation Setting=Dec...
2026.03
72.1
Qwen3-VL-4B-Instruct
Evaluation Setting=Dec...
2026.03
70.4
Qwen3-VL-8B-Instruct
Evaluation Setting=Dec...
2026.03
67.5
Ministral3-3B-Instruct
Evaluation Setting=Dec...
2026.03
34.5
Feedback
Search any
task
Search any
task