Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Planning on NuInteract (test)
Loading...
82.64
Accuracy
DriveMonkey
34.3424
46.8812
59.42
71.9588
Dec 29, 2025
Accuracy
Updated 2d ago
Evaluation Results
Method
Method
Links
Accuracy
DriveMonkey
Years=2025, LLM=Intern...
2025.12
82.64
GaussianDWM
Years=2025, LLM=Qwen3-8B
2025.12
66.27
InternVL1.5-2B
Years=2024, LLM=Intern...
2025.12
53.96
Qwen2VL
Years=2024, LLM=QWen2-7B
2025.12
49.33
InternVL2-8B
Years=2024, LLM=Intern...
2025.12
46.93
QWen2VL
Years=2024, LLM=Qwen2-2B
2025.12
45.59
InternVL2-2B
Years=2024, LLM=Intern...
2025.12
44.61
InternVL2-1B
Years=2024, LLM=Qwen2-...
2025.12
44.08
InternVL2-4B
Years=2024, LLM=Phi3-4B
2025.12
40.43
InternVL1.5-4B
Years=2024, LLM=Phi3-4B
2025.12
40.25
MiniCPM-V 2
Years=2024, LLM=MiniCP...
2025.12
36.69
MiniCPM-V 2.6
Years=2024, LLM=QWen2-7B
2025.12
36.42
LLaVA1.5
Years=2024, LLM=Vicuna-7B
2025.12
36.2
Feedback
Search any
task
Search any
task