Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Knowledge Benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
General KnowledgeKnowledge Benchmarks (ARC-C, ARC-E, MMLU, GPQA) (test)
ARC-C83.05
18
Knowledge-based Visual Question AnsweringKnowledge Benchmarks
Average Score48.2
12
Showing 2 of 2 rows