Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Knowledge-intensive QA on SQuAD 5-shot
Loading...
90.38
EM Accuracy
ADAFUSE (Top-2 Base)
70.7552
75.8501
80.945
86.0399
Jan 9, 2026
EM Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
EM Accuracy
ADAFUSE (Top-2 Base)
Base model selection s...
2026.01
90.38
ADAFUSE (Fixed Base)
Base model selection s...
2026.01
90.15
SWEETSPAN
Base model selection s...
2026.01
86.58
Mistral-7B-Instruct-v0.3
Model Type=Base Model
2026.01
83.49
UniTE
Base model selection s...
2026.01
82.17
LLM-BLENDER
Base model selection s...
2026.01
82.13
InternLM3-8B-Instruct
Model Type=Base Model
2026.01
81.77
LLaMA-3.1-8B-Instruct
Model Type=Base Model
2026.01
80.13
Qwen3-8B
Model Type=Base Model
2026.01
76.72
DEEPEN
Base model selection s...
2026.01
71.51
Feedback
Search any
task
Search any
task