Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hallucination Suppression on HealthBench Hallu

2.37Refuted Rate

GPT-5.2-High

2.23563.14284.054.9572Feb 6, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2.372.78
2026.02
2.452.07
2026.02
4.683.64
2026.02
5.735.43