Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Diagnostic Personalization on ITR
Loading...
89.4
Recall
Gemini-3.0-Pro
-3.576
20.562
44.7
68.838
Feb 3, 2026
Recall
Updated 4d ago
Evaluation Results
Method
Method
Links
Recall
Gemini-3.0-Pro
Inference Protocol=Direct
2026.02
89.4
Gemini-2.0-Flash
Inference Protocol=Direct
2026.02
66.1
Qwen3-VL-8B + CoViP (Ours)
Inference Protocol=w/ CAG
2026.02
42.8
Qwen3-VL-8B + CoViP (Ours)
Inference Protocol=Direct
2026.02
28
Qwen3-VL-8B + RePIC
Inference Protocol=w/ CAG
2026.02
27.8
Qwen3-VL-8B + RePIC
Inference Protocol=Direct
2026.02
27.2
Gemini-3.0-Pro
Inference Protocol=w/ CAG
2026.02
19
GPT-5
Inference Protocol=Direct
2026.02
18.6
GPT-4o
Inference Protocol=w/ CAG
2026.02
13.5
Gemini-2.0-Flash
Inference Protocol=w/ CAG
2026.02
12.2
GPT-5
Inference Protocol=w/ CAG
2026.02
10.5
Qwen3-VL-8B
Inference Protocol=Direct
2026.02
9.4
Qwen3-VL-30B-A3B
Inference Protocol=Direct
2026.02
8.8
GPT-4o
Inference Protocol=Direct
2026.02
8.4
Qwen3-VL-8B
Inference Protocol=w/ CAG
2026.02
6.8
Qwen3-VL-30B-A3B
Inference Protocol=w/ CAG
2026.02
0.4
Qwen3-VL-8B + RAP
Inference Protocol=w/ CAG
2026.02
0.2
Qwen3-VL-8B + RAP
Inference Protocol=Direct
2026.02
0
Feedback
Search any
task
Search any
task