Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ODA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-domain language model evaluationODA benchmark suite (test)
General Accuracy71.2
21
Multi-domain language model evaluationODA benchmark suite 1.0 (full)
General Score-
0
Showing 2 of 2 rows