Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context reasoning on LongSeal

64.96Accuracy

GEMINI 3.1 FLASH-LITE

31.794440.404749.01557.6253Apr 6, 2026
Updated 10d ago

Evaluation Results

MethodLinks
64.96
2026.04
64
59.84
2026.04
58.5
2026.04
52.17
42.18
2026.04
40.94
2026.04
38.19
2026.04
34.65
2026.04
33.07