Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Role-playing Agent Evaluation on PersonaGym

4.13Action Justification

GPT-4.1

3.353.55253.7553.9575May 16, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2026.05
4.134.1344.254.884.28
2026.05
3.883.633.754.254.924.09
2026.05
3.53.633.53.884.933.88
2026.05
3.383.133.133.754.913.66