SOTA Human Evaluation on LongBench Chat and PapersWithCode

14Helpfulness Win Rate

LongReward + DPO

Updated 3mo ago

Evaluation Results

Method	Links
LongReward + DPO 2024.10		14	84	2	12	14	86	0	14	32	64	4	28	26	64	10	16	54	38	8	46