Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Judge Evaluation on LLM-to-LLM Evaluation Reference: GPT-5.2

0.84Global Correlation (r)

GPT-5-mini

0.4760.57050.6650.7595Mar 12, 2026
Updated 2mo ago

Evaluation Results

MethodLinks
2026.03
0.840.543552
2026.03
0.490.29422