Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context Reasoning on Long-context Reasoning Suite (test)

74.91Average Score

Qwen3.5-35B-A3B

52.123658.039363.95569.8707May 19, 2026
Updated 14d ago

Evaluation Results

MethodLinks
2026.05
74.9168.8759.4474.8897.3778.7270.2
2026.05
74.7467.6262.8284.5977.2981.5673.7
2026.05
72.462.3865.7274.5179.9280.6271.28
2026.05
71.266.455.274.582.580.967.7
2026.05
69.865.355.174.581.673.668.7
2026.05
69.4364.1257.8977.9371.2474.6970.71
2026.05
68.8165.8745.9270.8796.0970.5263.6
2026.05
68.7364.7556.7765.7878.8479.3866.86
2026.05
68.6763.4459.4876.8664.8877.569.9
2026.05
68.4565.7557.4675.1266.1775.3170.9
2026.05
67.265.155.371.466.976.967.9
2026.05
62.262.545.566.667.565.165.9
2026.05
60.163.348.770.241.670.566.5
2026.05
59.462.547.967.447.964.765.8
2026.05
58.5561.2547.0172.6939.6864.3866.3
2026.05
57.0663.8843.7473.5443.8850.3167.1
2026.05
56.161.344.367.140.958.864.1
2026.05
536140.264.438.449.964