| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| M2D2 (test) | Coding expert | Accuracy (Coding)100 | 11 | 2mo ago | |
| WikiText-103 GPT-2 (124M) (train) | GEM (N = 2) | Train Loss4.4614 | 6 | 1mo ago | |
| WikiText-103 GPT-2 (124M) (val) | GEM (N = 2) | Validation Perplexity72.57 | 6 | 1mo ago | |
| CLM Eval | Memory Model | Hr0.684 | 6 | 3mo ago |