Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Feedback Evaluation Alignment benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Feedback Evaluation Alignment
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
MT Bench
TRACT
Kendall's Tau
0.494
11
4d ago
Vicuna Bench
TRACT
Kendall's Tau
0.423
6
4d ago
FLASK
Prometheus-2-7B
Kendall's Tau
0.405
6
4d ago
Feedback Bench
Mistral-7B-Instruct + CE (GPT-4 Score)
Kendall's Tau
82.4
6
4d ago
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task