| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multimodal Conversation | PCogAlignBench (LS2) | LLM Judge Score0.852 | 20 | |
| Multimodal Conversation | PCogAlignBench LS1 | LLM Judge Score0.903 | 20 | |
| Personalized VLM Alignment | PCogAlignBench LS1->LS1 1.0 | P Score4.303 | 16 | |
| Personalized response selection | PCogAlignBench Average | P Score4.154 | 14 | |
| Personalized response selection | PCogAlignBench LS2->LS2 | P Score4.151 | 14 | |
| Personalized response selection | PCogAlignBench LS2->LS1 | P Score4.15 | 14 | |
| Personalized response selection | PCogAlignBench LS1->LS2 | P. Score4.156 | 14 | |
| Personalized response selection | PCogAlignBench LS1->LS1 | P Score4.161 | 14 | |
| Personalized VLM Alignment | PCogAlignBench LS1->LS2 1.0 | P Score4.321 | 8 | |
| Personalized VLM Alignment | PCogAlignBench 1.0 | P Score4.312 | 4 | |
| Personalized VLM Alignment | PCogAlignBench LS2->LS1 1.0 | P Score4.275 | 4 | |
| Personalized VLM Alignment | PCogAlignBench LS2->LS2 1.0 | P Score4.321 | 2 |