Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Generative Performance on HealthBench

0.67Pearson r

RUDE

0.63650.653250.670.68675May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
0.670.0010.660.01