Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Closed-book Question Answering on Ever Young
Loading...
81.67
LLM Score
KNN
35.7332
47.6591
59.585
71.5109
Apr 10, 2026
LLM Score
Updated 5d ago
Evaluation Results
Method
Method
Links
LLM Score
KNN
Inference Tokens (inpu...
2026.04
81.67
Fine-tuning
Train Tokens (input /...
2026.04
72.08
DSPy GEPA
Train Tokens (input /...
2026.04
50.83
Initial prompt
Inference Tokens (inpu...
2026.04
42.08
AIR
Train Tokens (input /...
2026.04
42.08
DSPy BootstrapFewShot (#10)
Train Tokens (input /...
2026.04
38.3
DSPy MIPROv2
Train Tokens (input /...
2026.04
37.5
TextGrad
Train Tokens (input /...
2026.04
37.5
Feedback
Search any
task
Search any
task