Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Feedback Evaluation Alignment on Vicuna Bench
Loading...
0.423
Kendall's Tau
TRACT
0.10996
0.19123
0.2725
0.35377
Mar 6, 2025
Kendall's Tau
Updated 1mo ago
Evaluation Results
Method
Method
Links
Kendall's Tau
TRACT
CoT=true, Training obj...
2025.03
0.423
Prometheus-2-7B
CoT=true, Training obj...
2025.03
0.411
Mistral-7B-Instruct + RAFT
CoT=false, Training ob...
2025.03
0.401
Mistral-7B-Instruct + CE (GPT-4 CoT)
CoT=true, Training obj...
2025.03
0.38
Mistral-7B-Instruct + CE (GPT-4 Score)
CoT=false, Training ob...
2025.03
0.344
Mistral-7B-Instruct + RAIL Baseline
CoT=false, Inference m...
2025.03
0.122
Feedback
Search any
task
Search any
task