| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Pronoun Disambiguation Problem | PDP 2016 (test) | Accuracy78.3 | 21 | |
| Pickup and Delivery Problem | PDP20 uniform | Objective Value4.595 | 9 | |
| Commonsense Reasoning | PDP | Accuracy91.66 | 8 | |
| Coreference Resolution | PDP (test) | Accuracy95 | 7 |