Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Language Model Evaluation on WildBench

26.95WildBench Score

PUGC

25.577225.933626.2926.6464Jun 4, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.06
26.9546.5633.3611.4340.217.16
2025.06
25.6342.0730.0610.0840.118.4