| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| C-Eval | Kimi-K2 | Accuracy92.5 | 56 | 18d ago | |
| CMMLU (test) | FBS-Full | CMMLU Score0.574 | 13 | 1mo ago | |
| CMMLU | Kimi-K2 | Score90.9 | 10 | 1mo ago | |
| MMMLU | IRR | MMMLU Score37.08 | 8 | 1mo ago | |
| C-Eval (test) | Accuracy86 | 7 | 1mo ago | ||
| C-SimpleQA | Kimi-K2 | Score77.6 | 6 | 1mo ago | |
| C-Eval | Exact Match91.8 | 6 | 1mo ago | ||
| CLUEWSC | DeepSeek-R1 | Exact Match (EM)92.8 | 5 | 1mo ago |