| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| GenEval | OneCAT | Score90 | 21 | 18d ago | |
| M2LONGBENCH (test) | DEEP-REPORTER | Anchor Description39.7 | 19 | 4d ago | |
| DPG | X-Omni | DPG Score87.65 | 14 | 18d ago | |
| LT | LongCat-Next | LT-EN Score93.15 | 9 | 18d ago | |
| TIFF | LongCat-Next | TIFF Score (Part 1)82.85 | 7 | 18d ago | |
| Amazon-Baby (test) | GRAPHGPT-O | CLIP-I2 Score0.7477 | 6 | 1mo ago | |
| Amazon-Beauty (test) | GRAPHGPT-O | CLIP-I263.46 | 6 | 1mo ago | |
| ART500K (test) | GRAPHGPT-O | CLIP I2 Score77.62 | 6 | 1mo ago | |
| Social Factors OOD | SoMeLVLM | BLEU-L10.18 | 6 | 1mo ago | |
| Ideology | SoMeLVLM | BLEU-L24.08 | 6 | 1mo ago | |
| Emotion | SoMeLVLM | BLEU-L37.65 | 6 | 1mo ago | |
| Social Factors | SoMeLVLM | BLEU-L14.49 | 6 | 1mo ago | |
| Misinformation | SoMeLVLM | BLEU-L24.06 | 6 | 1mo ago | |
| Hate Speech | SoMeLVLM | BLEU-L31.04 | 6 | 1mo ago |