Share your thoughts, 1 month free Claude Pro on usSee more

Logical Deduction Five Objects on Big-Bench Hard (test)

52.33Accuracy

EvoPrompt(DE)-OPTS(TS)

Updated 4mo ago

Evaluation Results

Method	Links
EvoPrompt(DE)-OPTS(TS) 2025.03		52.33
EvoPrompt(GA)-OPTS(TS) 2025.03		48.17
EvoPrompt(DE) 2025.03		2.67
EvoPrompt(GA) 2025.03		1.67