Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Reasoning on TIR-Bench

19.8Accuracy

PyVision-Image

15.84816.87417.918.926Feb 24, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
19.8
2026.02
17.3
2026.02
16