| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MT Math100 | ROSA2 | Accuracy93.6 | 64 | 1mo ago | |
| MGSM (test) | QuaSAR | Accuracy93.4 | 57 | 1mo ago | |
| MGSM | DeepSeek V3.2 | Accuracy94.9 | 52 | 8d ago | |
| MGSM 1.0 (test) | MindMerger-Hard | Accuracy (ru)69.6 | 35 | 1mo ago | |
| MSVAMP | Task Arithmetic | Accuracy (English)75 | 33 | 1mo ago | |
| MGSM | FLy | Speedup (de)6.08 | 20 | 1mo ago | |
| MGSM8KInstruct (test) | Accuracy (En)47.6 | 7 | 1mo ago | ||
| MCLM 1.0 (test) | MT-MATH100 Score84.89 | 7 | 1mo ago | ||
| MGSM 18 languages | SP3F-7B | Accuracy72.5 | 6 | 1mo ago |