Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General QA on GPQA Diamond

69.19Exact Match

DeepSeek-R1

43.4550.132556.81563.4975Dec 8, 2025
Updated 4d ago

Evaluation Results

MethodLinks
69.19
68.69
44.44