Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-fidelity Bandit Optimization on LLM-as-a-judge residual-mismatch Λ=128000 (test)
Loading...
4,023.4
Mean Cost-Weighted Pseudo-Regret
TACC
3,969.972
4,330.611
4,691.25
5,051.889
May 8, 2026
Mean Cost-Weighted Pseudo-Regret
Standard Error (Regret)
Updated 22d ago
Evaluation Results
Method
Method
Links
Mean Cost-Weighted Pseudo-Regret
Standard Error (Regret)
TACC
lambda_H (high-fidelit...
2026.05
4,023.4
247.3
UCB
lambda_H (high-fidelit...
2026.05
5,083.2
286.8
DNC
lambda_H (high-fidelit...
2026.05
5,201
289.7
MF-UCB
lambda_H (high-fidelit...
2026.05
5,359.1
281.8
Feedback
Search any
task
Search any
task