Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AGI Eval

Benchmarks

Task NameDataset NameSOTA ResultTrend
General ReasoningAGI Eval English
Score90.1
32
ReasoningAGI Eval EN
Accuracy89.4
15
General Intelligence EvaluationAGI Eval English
Score92.2
8
Text-to-Image GenerationAGI-Eval text-to-image arena 6
ELO Score0.4859
6
General Intelligence EvaluationAGI-Eval
Accuracy44.2
2
Showing 5 of 5 rows