Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Alignment Evaluation on Open-ended questions

68.9Win Rate

Single-Agent

37.1845.41553.6561.885Mar 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
68.9
2026.03
68.7
2026.03
63.4
2026.03
51.8
2026.03
40.4
2026.03
38.4