Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Logical Deduction Five Objects on Big-Bench Hard (test)
Loading...
52.33
Accuracy
EvoPrompt(DE)-OPTS(TS)
-0.3564
13.3218
27
40.6782
Mar 3, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
EvoPrompt(DE)-OPTS(TS)
Task-solving LLM=GPT-4...
2025.03
52.33
EvoPrompt(GA)-OPTS(TS)
Task-solving LLM=GPT-4...
2025.03
48.17
EvoPrompt(DE)
Task-solving LLM=GPT-4...
2025.03
2.67
EvoPrompt(GA)
Task-solving LLM=GPT-4...
2025.03
1.67
Feedback
Search any
task
Search any
task