Share your thoughts, 1 month free Claude Pro on usSee more

Linguistic Reasoning on BigBench Hard Disambiguation QA

55.1Accuracy

ReElicit

Updated 2mo ago

Evaluation Results

Method	Links
ReElicit 2026.05		55.1
TextGrad 2026.05		53.2
OPRO 2026.05		52.4
PromptBreeder 2026.05		51.6
APE 2026.05		51.4