Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Logical Deduction Five Objects on Big-Bench Hard (test)

52.33Accuracy

EvoPrompt(DE)-OPTS(TS)

-0.356413.32182740.6782Mar 3, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.03
52.33
2025.03
48.17
2025.03
2.67
2025.03
1.67