| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| C-Eval | Kimi-K2 | Accuracy92.5 | 47 | 3d ago | |
| CMMLU (test) | FBS-Full | CMMLU Score0.574 | 13 | 3d ago | |
| CMMLU | Kimi-K2 | Score90.9 | 10 | 3d ago | |
| MMMLU | IRR | MMMLU Score37.08 | 8 | 3d ago | |
| C-Eval (test) | Accuracy86 | 7 | 3d ago | ||
| C-SimpleQA | Kimi-K2 | Score77.6 | 6 | 3d ago | |
| C-Eval | Exact Match91.8 | 6 | 3d ago | ||
| CLUEWSC | DeepSeek-R1 | Exact Match (EM)92.8 | 5 | 3d ago |