Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on LongBench 15 English

18.26NarrativeQA

Baseline

14.536815.503416.4717.4366Mar 26, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.03
18.2612.0125.9613.767.8714.9532.7921.4326.9551.934787.7644.727037.534.19
2026.03
15.5611.7723.7114.398.4114.923221.1927.0351.535.9784.142.6268.537.0832.58
2026.03
15.4611.8223.6814.397.7714.5332.3221.226.8650.4635.5984.342.6768.537.1732.45
2026.03
14.6810.824.9514.218.3914.232.0122.0126.4347.537.5185.5442.056936.3632.38