| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Classification | penguins | Improvement in Balanced Accuracy7.8 | 8 | |
| Data Synthesis | penguins | MMD0.077 | 8 | |
| Multiclass Classification | penguins | AUC100 | 7 | |
| Object Counting | Penguins (Mixed) | Max Count9.81 | 6 | |
| Object Counting | Penguins Separated | Max Count3.74 | 6 | |
| Logical Reasoning | Penguins | Accuracy72.734 | 4 | |
| Symbolic Reasoning | Penguins | Solve Rate93.3 | 4 |