Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Human Evaluation on 50 randomly selected model responses

98Clarity

GPT-4.1

68.8876.448491.56Nov 28, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.11
989698
848282
707464