Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
tracking shuffled objects seven objects on BBH (test)
Loading...
88.33
Accuracy
EvoPrompt(GA)-OPTS(TS)
75.1636
78.5818
82
85.4182
Mar 3, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
EvoPrompt(GA)-OPTS(TS)
Task-solving LLM=GPT-4...
2025.03
88.33
EvoPrompt(DE)-OPTS(TS)
Task-solving LLM=GPT-4...
2025.03
81.83
EvoPrompt(DE)
Task-solving LLM=GPT-4...
2025.03
80.67
EvoPrompt(GA)
Task-solving LLM=GPT-4...
2025.03
75.67
Feedback
Search any
task
Search any
task