Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Task Performance on Agent Task Benchmark 240 documents 1.0 (Evaluation set)

92.3Information Lookup Success Rate

OBJECTGRAPH(E)

87.20488.52789.8591.173Apr 30, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
92.390.186.295.180.391.696.594.190.8
2026.04
92.189.485.794.877.991.496.393.290.1
2026.04
91.288.684.376.482.161.352.871.476
2026.04
87.483.179.871.274.654.748.169.371