Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Failure attribution on Who & When Baseline

54.33Agent Attribution Accuracy

All-at-Once

27.674834.594941.51548.4351Mar 18, 2026
Updated 2mo ago

Evaluation Results

MethodLinks
2026.03
54.3312.5
2026.03
44.1323.98
2026.03
41.469.76
2026.03
35.225.51
2026.03
34.131.59
2026.03
28.712.04