Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Feedback Evaluation Alignment on Vicuna Bench
Loading...
0.423
Kendall's Tau
TRACT
0.10996
0.19123
0.2725
0.35377
Mar 6, 2025
Kendall's Tau
Updated 4d ago
Evaluation Results
Method
Method
Links
Kendall's Tau
TRACT
CoT=true, Training obj...
2025.03
0.423
Prometheus-2-7B
CoT=true, Training obj...
2025.03
0.411
Mistral-7B-Instruct + RAFT
CoT=false, Training ob...
2025.03
0.401
Mistral-7B-Instruct + CE (GPT-4 CoT)
CoT=true, Training obj...
2025.03
0.38
Mistral-7B-Instruct + CE (GPT-4 Score)
CoT=false, Training ob...
2025.03
0.344
Mistral-7B-Instruct + RAIL Baseline
CoT=false, Inference m...
2025.03
0.122
Feedback
Search any
task
Search any
task