Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Financial Reasoning on OPENFIN (held-out)
Loading...
73.4
DBS
EXPERT
21.816
35.208
48.6
61.992
Feb 11, 2026
DBS
DVSavg@32
Updated 4d ago
Evaluation Results
Method
Method
Links
DBS
DVSavg@32
EXPERT
Model=Qwen3-1.7B optim...
2026.02
73.4
-
DataChef-32B
Oracle Upper Bound=tru...
2026.02
67.1
-
Qwen3-Next ⊕ Kimi-K2
Reasoning backbone=Qwe...
2026.02
64
54.7
DataChef-32B
2026.02
63.9
67
SOURCEbest
2026.02
63.7
-
Gemini-3-Pro
2026.02
61.8
54.9
Kimi-K2
2026.02
46.5
51.5
SOURCEavg
2026.02
41.7
-
Qwen3-32B
2026.02
23.8
34.9
Feedback
Search any
task
Search any
task