Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multiple-Choice Reasoning on Date Understanding (test)
Loading...
78.2
Accuracy
RIOT
71.648
73.349
75.05
76.751
Jun 19, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
RIOT
optimization=automatic
2025.06
78.2
Twenty-Shot CoT
optimization=manual, s...
2025.06
76.3
OPRO
optimization=automatic
2025.06
76.3
Zero-Shot CoT
optimization=manual, s...
2025.06
76.1
DSPy
optimization=automatic
2025.06
74.3
Four-Shot CoT
optimization=manual, s...
2025.06
73.2
APE
optimization=automatic
2025.06
72.8
TextGrad
optimization=automatic
2025.06
71.9
Feedback
Search any
task
Search any
task