Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
snarks on BBH (test)
Loading...
79.43
Accuracy
EvoPrompt(GA)
78.6188
78.8294
79.04
79.2506
Mar 3, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
EvoPrompt(GA)
Task-solving LLM=GPT-4...
2025.03
79.43
EvoPrompt(GA)-OPTS(TS)
Task-solving LLM=GPT-4...
2025.03
78.65
Feedback
Search any
task
Search any
task