Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ODA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-domain language model evaluationODA benchmark suite (test)
General Accuracy71.2
21
Multi-domain language model evaluationODA benchmark suite 1.0 (full)
General Score-
0
Showing 2 of 2 rows