| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text Classification | Beer | Accuracy88.5 | 7 | |
| Human forward simulatability | Beer (test) | Accuracy98.3 | 5 | |
| Sentiment Classification | Beer (test) | BVE (Avg Acc Diff)0 | 4 | |
| Concept Comprehensibility Evaluation | Beer | Semantics85 | 4 | |
| Concept Intruder Detection | Beer | Accuracy75 | 4 | |
| Entity Matching | Beer | F1 Score100 | 4 | |
| Entity Matching | Beer (test) | F1 Score100 | 4 |