| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Machine Translation | general 2023 (test) | BLEU32.64 | 16 | |
| Image Manipulation Detection | General Inference Speed Evaluation Images | FPS31.7 | 16 | |
| Instance Erasure | General | FID (General)13.24 | 13 | |
| Stability | General (MMLU, BBH, TyDiQA, BoolQ, PIQA, GSM8K) | General Score55.75 | 9 | |
| Video Compression | General | Parameters (M)18.34 | 9 | |
| Segmentation | General Efficiency Evaluation | Latency (ms)7.3 | 9 | |
| Underwater Image Enhancement | General Architectural Comparison 1.0 (UEIB-T90) | PSNR22.82 | 8 | |
| General Vision-Language Understanding | General | Avg Score72.4 | 8 | |
| Average evaluation across 7 tasks | General (test) | BERTScore76.5 | 8 | |
| Colon Polyp Segmentation | General | Parameters (M)32.55 | 8 | |
| 360-degree video saliency prediction | General | Params (M)3.7 | 7 | |
| Model Efficiency Analysis | General 16 frames, 512 text tokens (inference) | FPS20.74 | 6 | |
| Interactive Segmentation | General Efficiency Benchmarking | Parameters (MB)84.89 | 6 | |
| Novel View Synthesis | General | MFLOPs / Pixel13.77 | 5 | |
| Ending event prediction | General (test) | MRR0.401 | 5 | |
| Vulnerability Analysis | General | Metric- | 0 | |
| 3D Scene Decomposition Capability Assessment | General Method Capability Comparison | Metric- | 0 | |
| Conditional Layout Generation | General Literature Comparison | Metric- | 0 |