| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Modeling | CodeParrot 16K | Perplexity2.25 | 8 | |
| Language Modeling | CodeParrot 4K | Perplexity2.33 | 8 | |
| Language Modeling | CodeParrot 100K | Perplexity2.36 | 6 | |
| Language Modeling | CodeParrot 32K | Perplexity2.23 | 6 | |
| Language Modeling | CODEPARROT | Perplexity3.36 | 4 | |
| Token-level Reconstruction | codeparrot-clean In-Domain (val) | Reconstruction Accuracy (TRA)99.47 | 1 |