Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Ability Evaluation on In-House General Benchmark

68.82Knowledge & Comprehension

GPT-5

55.986459.318262.6565.9818Dec 8, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
68.8275.9580.9485.0886.179.39
2025.12
65.8774.6378.3182.0686.2577.42
62.774.5569.9180.1688.5775.19
2025.12
62.0777.7574.5476.8388.8676.01
2025.12
61.3973.5174.5479.8780.1973.9
2025.12
56.4862.3462.7367.9472.9564.49