| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LRA ListOps Length 2000 Arguments 10 | OM | Accuracy80.1 | 11 | 4d ago | |
| ListOps-O Argument Generalization (Arguments 15) | EBT-GRC | Accuracy79 | 11 | 4d ago | |
| ListOps-O Argument Generalization (Arguments 10) | OM | Accuracy0.8415 | 11 | 4d ago | |
| ListOps-O Length Generalization (Lengths 900-1000) | EBT-GRC | Accuracy99.5 | 11 | 4d ago | |
| ListOps-O Length Generalization (Lengths 500-600) | EBT-GRC | Accuracy99.4 | 11 | 4d ago | |
| ListOps-O Length Generalization (Lengths 200-300) | EBT-GRC | Accuracy99.9 | 11 | 4d ago | |
| ListOps-O near-IID (Lengths < 1000, Arguments < 5) | OM | Accuracy99.9 | 11 | 4d ago |