Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on MathQA (Exact Match)
Loading...
52.4
Exact Match
AdapShot
-0.848
12.976
26.8
40.624
May 5, 2026
Exact Match
Updated 28d ago
Evaluation Results
Method
Method
Links
Exact Match
AdapShot
Backbone=Qwen2.5-7B
2026.05
52.4
Many-shot (256)
Backbone=Qwen2.5-7B
2026.05
48.3
Few-shot (8)
Backbone=Qwen2.5-7B
2026.05
35
DBSA
Backbone=Qwen2.5-7B
2026.05
32.3
Many-shot (512)
Backbone=Qwen2.5-7B
2026.05
23.2
Zero-shot
Backbone=Qwen2.5-7B
2026.05
1.2
Feedback
Search any
task
Search any
task