Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Personalized LLM response generation on Koala (test)
Loading...
88
Win Rate (Reward Model)
CLIPer
52.2968
61.5659
70.835
80.1041
May 8, 2026
Win Rate (Reward Model)
Win Rate (GPT-4o-mini)
Average Win Rate
Updated 23d ago
Evaluation Results
Method
Method
Links
Win Rate (Reward Model)
Win Rate (GPT-4o-mini)
Average Win Rate
CLIPer
Comparison Baseline=Va...
2026.05
88
84.33
83.42
CLIPer
Comparison Baseline=p-...
2026.05
78.67
39.67
58
CLIPer
Comparison Baseline=Di...
2026.05
53.67
55
53
Feedback
Search any
task
Search any
task