Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language modeling on LongBench V2 (test)

60Acc (Short)

Qwen3-32B

30.900838.455446.0153.5646Dec 3, 2025Dec 22, 2025Jan 11, 2026Jan 30, 2026Feb 19, 2026Mar 10, 2026Mar 30, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.03
6041.153.146.849.2-47.2
2026.03
574051.146.248.1-49.4
2026.03
463338.635.236.5-27.6
2026.03
4435.439.134.836.4-25.9
2025.12
35.3920.9334.4428.7430.686.56-
2025.12
33.7118.634.4425.8628.790-
2025.12
32.0219.7826.6728.7428.03-2.64-