Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Logical Reasoning on BigBench Hard Formal Fallacies
Loading...
58.2
Accuracy
OPRO
53.728
54.889
56.05
57.211
May 18, 2026
Accuracy
Updated 14d ago
Evaluation Results
Method
Method
Links
Accuracy
OPRO
evaluations=30 prompt...
2026.05
58.2
ReElicit
evaluations=30 prompt...
2026.05
58.1
APE
evaluations=30 prompt...
2026.05
57.6
PromptBreeder
evaluations=30 prompt...
2026.05
54.6
TextGrad
evaluations=30 prompt...
2026.05
53.9
Feedback
Search any
task
Search any
task