| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Downstream Performance Evaluation | CORE | CORE Score19.94 | 53 | |
| Domain-Incremental Learning | CORe50 | Avg Accuracy (A)99.5 | 49 | |
| Few-Shot Class-Incremental Learning | CORe50 | BCR99.9 | 39 | |
| Cross-modal geo-localization | CORE Intercontinental-level Subset4 1.0 | R@151.35 | 15 | |
| Cross-modal geo-localization | CORE Intercontinental-level Subset3 1.0 | R@149.81 | 15 | |
| Cross-modal geo-localization | CORE Intercontinental-level Subset2 1.0 | R@164.74 | 15 | |
| Cross-modal geo-localization | CORE Intercontinental-level Subset1 1.0 | R@157.9 | 15 | |
| Cross-modal geo-localization | CORE World-level 1.0 (All) | R@155.84 | 15 | |
| Reasoning | CORE-Ext | Accuracy15.9 | 10 | |
| Reasoning | CORE | Accuracy26.83 | 10 | |
| Language Modeling | Core-Extended | Score17.08 | 8 | |
| Language Modeling | Core | Score28.44 | 8 | |
| Relation Classification | CORE | F1-Mic80 | 8 | |
| General Language Understanding | CORE | CORE Score26.32 | 4 | |
| Comprehensive Optimization and Reasoning Evaluation | CORE | CORE Score25.14 | 4 | |
| Control-Dependency / Trace extraction | CoRe Lite Control-Dependency Trace subtask n=489 | F1 Score94.58 | 3 | |
| Latent centroid displacement prediction | Core Level 5 approximation | Observed Displacement6.39 | 3 |