Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LD

Benchmarks

Task NameDataset NameSOTA ResultTrend
ClassificationLD UCI Repository (test)
Accuracy68.1
6
MCQ ClassificationLD 3 v1 (Eva)
Accuracy100
6
MCQ ClassificationLD 3 v1 (infer)
Accuracy100
6
Logical ReasoningLD (val)
Accuracy78.33
5
Showing 4 of 4 rows