Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Prompt Optimization benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Prompt Optimization
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
XSum
EGE
Hypervolume (HV)
0.1626
72
19d ago
Prompt Optimization Benchmark
LinGO
Accuracy
69
24
3mo ago
Logical Reasoning, Mathematical Calculation, and Knowledge Intensive tasks Average
MemAPO
Average Performance (%)
70.7
20
2mo ago
VisEval
PromptAgent
Accuracy (Easy)
0.77
10
3mo ago
DABench
PromptBreeder
Acc (Easy)
80
10
3mo ago
HotpotQA, IFBench, HoVer, PUPA, AIME, and LiveBench-Math 2018-2025 (test)
GEPA
HotpotQA Score
69
8
3mo ago
DSG-1K
CRAFT
DSGScore
0.91
7
3mo ago
P2-hard
Maestro
DSGScore
92
7
3mo ago
10-task prompt optimization suite GSM8K MMLU BBH
ReElicit
Average Win/Tie Rate
81
5
14d ago
product-gen (test)
Bayesian
Accuracy
92.2
5
1mo ago
trip-advisory (test)
COPRO-R
Accuracy
81.1
5
1mo ago
code-explain (test)
COPRO-R
Accuracy
84.2
5
1mo ago
42 LLM benchmarks Aggregate (overall)
System+Task Optimized
Average Score
67.14
5
1mo ago
GEPA Evaluation Suite Aggregate
LEVI
Aggregate Score
62.02
4
21d ago
PUPA
GEPA
Score
91.85
4
21d ago
Hover
GEPA
Score
52.33
4
21d ago
IFBench
LEVI
Score
46.33
4
21d ago
HotpotQA
LEVI
Score
63
4
21d ago
website-gen (test)
COPRO-R
Accuracy
53.5
4
1mo ago
Flickr
EDITOR
Mean CLIP Score
77.62
4
2mo ago
Dataset with human annotations (test)
LinGO (RAG)
Accuracy
69
4
3mo ago
Showing 21 of 21 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs