Share your thoughts, 1 month free Claude Pro on usSee more

Long-context Language Modeling on LongBench (MultiFieldQA, MuSiQue, GovReport 2023 test)

32.18MultiFieldQA Score

DroPE

Updated 4mo ago

Evaluation Results

Method	Links
DroPE 2025.12		32.18	753	24.77	21.49
YaRN 2025.12		27.6	390	17.19	16.23
RoPE-NTK 2025.12		27.58	337	24.65	18.53
DroPE 2025.12		25.9	1,288	39.47	26.08
YaRN 2025.12		23.13	765	26.65	19.14
RoPE-NTK 2025.12		21.81	1,091	32.91	21.88
Base 2025.12		17.26	1,043	32.41	20.03
Base 2025.12		4.12	50	4.7	3.11