Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LD

Benchmarks

Task NameDataset NameSOTA ResultTrend
MCQ ClassificationLD 3 v1 (Eva)
Accuracy100
6
MCQ ClassificationLD 3 v1 (infer)
Accuracy100
6
Logical ReasoningLD (val)
Accuracy78.33
5
Showing 3 of 3 rows