Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on LongBench (Qasper, HotpotQA, LCC, and 9 other metrics subset)

30.19Qasper

Standard

18.198821.311924.42527.5381Oct 17, 2024
Updated 14d ago

Evaluation Results

MethodLinks
2024.10
30.1938.4710.1711.9224.776886.3538.091.6748.3340.8936.26
2024.10
26.9237.239.1212.7824.6365.585.3537.743.148.6838.3335.43
2024.10
23.4829.588.5611.3520.9464.584.838.223.344.936.6233.3
2024.10
18.6624.145.577.6420.096670.4424.872.0433.3822.0926.81