Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Penguins

Benchmarks

Task NameDataset NameSOTA ResultTrend
Classificationpenguins
Improvement in Balanced Accuracy7.8
8
Data Synthesispenguins
MMD0.077
8
Multiclass Classificationpenguins
AUC100
7
Object CountingPenguins (Mixed)
Max Count9.81
6
Object CountingPenguins Separated
Max Count3.74
6
Logical ReasoningPenguins
Accuracy72.734
4
Symbolic ReasoningPenguins
Solve Rate93.3
4
Showing 7 of 7 rows