| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Influence Estimation | Benchmarks Budgets k=1, 5, 10, 25 (Aggregated) | AUC (SR, dB)42.73 | 66 | |
| General Multimodal Understanding | Combined 9 Benchmarks | Average Accuracy100 | 13 | |
| Zero-shot language understanding | Zero-shot Benchmarks | Average Zero-shot Accuracy51.47 | 9 | |
| General Language Understanding | 10 Benchmarks Average (test) | Accuracy (Average)63.7 | 6 |