Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Feedback Evaluation Alignment on Feedback Bench
Loading...
82.4
Kendall's Tau
Mistral-7B-Instruct + CE (GPT-4 Score)
10.224
28.962
47.7
66.438
Mar 6, 2025
Kendall's Tau
Updated 4d ago
Evaluation Results
Method
Method
Links
Kendall's Tau
Mistral-7B-Instruct + CE (GPT-4 Score)
CoT=false, Training ob...
2025.03
82.4
TRACT
CoT=true, Training obj...
2025.03
82
Mistral-7B-Instruct + RAFT
CoT=false, Training ob...
2025.03
81.8
Mistral-7B-Instruct + CE (GPT-4 CoT)
CoT=true, Training obj...
2025.03
79.8
Prometheus-2-7B
CoT=true, Training obj...
2025.03
76.5
Mistral-7B-Instruct + RAIL Baseline
CoT=false, Inference m...
2025.03
13
Feedback
Search any
task
Search any
task