Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Active Visual Search on SAT (real)
Loading...
77.33
Accuracy
SFT + RL (Ours)
56.53
61.93
67.33
72.73
Dec 15, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
SFT + RL (Ours)
Backbone=Qwen2.5-VL-7B...
2025.12
77.33
SFT
Backbone=Qwen2.5-VL-7B...
2025.12
67.33
Qwen2.5-VL-7B
Backbone=Qwen2.5-VL-7B
2025.12
60
RL
Backbone=Qwen2.5-VL-7B...
2025.12
57.33
Feedback
Search any
task
Search any
task