| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Behavior Prediction | GROVE TechCrunch 1.0 | Average Performance Score85.7 | 5 | |
| Behavior Prediction | GROVE Full Wikipedia domains 1.0 | Accuracy (Aero.)70.7 | 5 | |
| Behavior Prediction | GROVE Wikipedia 1.0 (P2 & P3) | Accuracy (Aero.)76 | 3 |