Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Aggregated LLM Evaluation on Balanced Objective Aggregate Suite
Loading...
53.2
Weighted Average Score
CAMEL
49.456
50.428
51.4
52.372
Mar 9, 2026
Weighted Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Weighted Average Score
CAMEL
Training Objective=Bal...
2026.03
53.2
SODM
Training Objective=Bal...
2026.03
52.6
DML
Training Objective=Bal...
2026.03
51.9
Model-size agnostic
Training Objective=Bal...
2026.03
51.4
Human Designed
Training Objective=Bal...
2026.03
49.6
Feedback
Search any
task
Search any
task