| TimeQA Hard 1.0 (test) | Qwen3-14B | EM82.2 | | 24 | 4d ago |
| TimeQA Easy 1.0 (test) | GPT-4o-mini | EM93.7 | | 24 | 4d ago |
| TempReason-L3 in-domain (test) | Qwen3-14B | EM0.851 | | 20 | 4d ago |
| TempReason-L2 in-domain (test) | GPT-4o-mini | EM80.8 | | 20 | 4d ago |
| LoCoMo | Membox | F1 Score65.06 | | 17 | 4d ago |
| ICEWS05-15 (test) | GETER | Positive Score78.94 | | 17 | 4d ago |
| GDELT (test) | GETER | Positive Accuracy63.77 | | 17 | 4d ago |
| ICEWS14 (test) | GETER | Positive Score77.45 | | 17 | 4d ago |
| RoboInter-VQA Temporal | RoboInter-Qwen-7B | Visual Trace81.9 | | 13 | 3d ago |
| Time-Dialog (test) | GPT-4 | Location Accuracy88.9 | | 13 | 4d ago |
| EgoAVU-Bench | EgoAVU-Instruct (Full) | Accuracy67.84 | | 9 | 4d ago |
| TempQuestions (test) | QAaP | Exact Match (EM)60.3 | | 9 | 4d ago |
| LongMemEval S (test) | HIPPOCAMPUS | F1 Score15.03 | | 7 | 2d ago |
| LongMemEval-M | HIPPOCAMPUS | F1 Score12.69 | | 7 | 2d ago |
| LoCoMo (test) | FullContext | LLM Score0.742 | | 7 | 4d ago |
| FineAction CGR | SlowFocus | Accuracy53.1 | | 6 | 2d ago |
| Time-Dialog Out-of-Domain (test) | MemAgent | F1 Score40.2 | | 6 | 4d ago |
| BIG-bench Hard Temporal Sequences (test) | PE2 | Test Accuracy62 | | 4 | 3d ago |
| TimeQuestions (test) | EXAQT | EM56.8 | | 4 | 4d ago |
| LongMemEval S | - | F1- | | 0 | 4d ago |
| FineAction CGR (test) | - | Accuracy- | | 0 | 4d ago |