Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Human Evaluation on LongBench Chat

14Helpfulness Win Rate

LongReward + DPO

13.313.651414.35Oct 28, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.10
148421214860143264428266410165438846