| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Combined | Multilingual Benchmark | IA Score8.83 | 34 | |
| Reasoning | Multilingual Benchmark | IA Score6.04 | 17 | |
| Style | Multilingual Benchmark | IA8.82 | 17 | |
| Translate | Multilingual Benchmark | IA6.34 | 17 | |
| Rearrange | Multilingual Benchmark | IA Score7.37 | 17 | |
| Delete | Multilingual Benchmark | IA8.6 | 17 | |
| Replace | Multilingual Benchmark | IA8.39 | 17 | |
| Add | Multilingual Benchmark | IA9.59 | 17 | |
| Multilingual NLP | Multilingual Benchmark Average across languages (test) | Average Score89.15 | 8 | |
| Text Rendering | Multilingual Benchmark English (test) | Character Precision99.68 | 7 | |
| Text Rendering | Multilingual Benchmark Chinese (test) | Character Precision (Chars Pre)93.44 | 5 |