| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Visual Mathematical Reasoning | MathVerse | Accuracy69.2 | 73 | |
| Mathematical Reasoning | MathVerse mini | Accuracy82.6 | 50 | |
| Mathematical Reasoning | MathVerse | Accuracy81.45 | 39 | |
| Multimodal Mathematical Reasoning | MathVerse (test) | Accuracy (ALL)64.9 | 33 | |
| Mathematical Multimodal Reasoning | MathVerse | Accuracy57.6 | 29 | |
| Mathematical problem solving | MathVerse (testmini) | Accuracy64.9 | 28 | |
| Multimodal Mathematical Reasoning | MathVerse mini (test) | T-Dominant Score70.94 | 26 | |
| Multimodal Reasoning | MathVerse MINI | Accuracy77.7 | 25 | |
| Multimodal Reasoning | MathVerse | Accuracy55.8 | 20 | |
| Multimodal Autoformalization | MATHVERSE Solid Geometry | Compilation Success80 | 20 | |
| Step-wise Verification | MathVerse VO | Macro F162.8 | 18 | |
| Multimodal Reasoning | MathVerse (testmini) | Mean@1 Accuracy57.6 | 17 | |
| Multimodal Mathematical Reasoning | MathVerse-V | Accuracy81.2 | 17 | |
| Visual Mathematical Reasoning | MathVerse mini Vision Only | Avg@3 Score48.6 | 14 | |
| Mathematical Reasoning | MathVerse Vision Only | Accuracy53.4 | 14 | |
| Mathematical Reasoning | MathVerse-Plus Vision Only 1.0 (test) | Accuracy31 | 12 | |
| Mathematical Reasoning | MathVerse-Plus Vision Dominant 1.0 (test) | Accuracy41 | 12 | |
| Mathematical Reasoning | MathVerse-Plus Text Dominant 1.0 (test) | Accuracy53 | 12 | |
| Mathematical Reasoning | MathVerse-Plus All 1.0 (test) | Accuracy43.6 | 12 | |
| Multimodal Mathematical Reasoning | MathVerse mini-vision | Score31.2 | 12 | |
| Multimodal mathematical reasoning | MathVerse (vision) | Pass@1 Accuracy76.39 | 11 | |
| General Perception and Reasoning | MathVerse VO | Score52 | 11 | |
| Multimodal Autoformalization | MATHVERSE Function | Compile Rate100 | 10 | |
| Multimodal Autoformalization | MATHVERSE Plane Geometry | Compile Rate76 | 10 | |
| Mathematical Visual Question Answering | MathVerse | Accuracy26.4 | 8 |