Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Event Logical Reasoning on BizFinBench v2
Loading...
73.34
Accuracy
Fine-tuning
33.4664
43.8182
54.17
64.5218
Apr 10, 2026
Accuracy
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
Fine-tuning
Train Tokens (input /...
2026.04
73.34
KNN
Inference Tokens (inpu...
2026.04
51.67
AIR
Train Tokens (input /...
2026.04
51.67
DSPy GEPA
Train Tokens (input /...
2026.04
46.67
Initial prompt
Inference Tokens (inpu...
2026.04
45
DSPy MIPROv2
Train Tokens (input /...
2026.04
43.33
TextGrad
Train Tokens (input /...
2026.04
43.33
DSPy BootstrapFewShot (#3)
Train Tokens (input /...
2026.04
35
Feedback
Search any
task
Search any
task