Share your thoughts, 1 month free Claude Pro on usSee more

Human Evaluation on 50 randomly selected model responses

98Clarity

GPT-4.1

Updated 4mo ago

Evaluation Results

Method	Links
GPT-4.1 2025.11		98	96	98
Gemma-3-4B-it 2025.11		84	82	82
Phi-3-mini-4k-instruct 2025.11		70	74	64