Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-Context Language Modeling on LongBench 1.0 (test)

32.6NrtvQA

FULL

17.93621.74325.5529.357Apr 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.04
32.645.550.459.65640.133.223.925.27270.837.11810012.418.443.4
2026.04
32.244.151.960.255.138.932.424.324.771.570.737.416.510011.81843.1
2026.04
32.144.853.361.654.638.53123.624.87270.336.215.510010.717.342.6
2026.04
29.543.950.961.455.838.830.823.824.3717035.7159710.517.342.2
2026.04
20.829.443.13324.414.730.822.826.766.58422.5030.554.759.235.2
2026.04
20.429.54334.623.714.229.8-26.166.584.722.9028.553.65935
2026.04
19.927.740.933.924.214.428.621.825.966.584.222.31.527.352.657.734.3
2026.04
18.526.940.634.222.71429.320.925.666.584.321.91.526.152.556.833.9