Share your thoughts, 1 month free Claude Pro on usSee more

Feedback Evaluation Alignment on Vicuna Bench

0.423Kendall's Tau

TRACT

Updated 4mo ago

Evaluation Results

Method	Links
TRACT 2025.03		0.423
Prometheus-2-7B 2025.03		0.411
Mistral-7B-Instruct + RAFT 2025.03		0.401
Mistral-7B-Instruct + CE (GPT-4 CoT) 2025.03		0.38
Mistral-7B-Instruct + CE (GPT-4 Score) 2025.03		0.344
Mistral-7B-Instruct + RAIL Baseline 2025.03		0.122