| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MemoryCD Books | LATTE GRU | ROUGE-L Score27.8 | 35 | 7d ago | |
| BESPOKE | P-CHECK | R-L9.94 | 18 | 3mo ago | |
| OmniPBench | CLIP-I Score0.832 | 12 | 3mo ago | ||
| Roleplay (test) | POPI-Full | Accuracy72.36 | 10 | 1mo ago | |
| Review (test) | POPI-Full | Accuracy95.76 | 10 | 1mo ago | |
| ELIX (test) | POPI-Full | Accuracy80.14 | 10 | 1mo ago | |
| LongLaMP Pair A Writing (test) | SPECSTEER | ROUGE-130.79 | 8 | 2mo ago | |
| LongLaMP (Pair A) - Review (test) | SPECSTEER | ROUGE-133.03 | 8 | 2mo ago | |
| LongLaMP (Pair A) - Abstract (test) | SPECSTEER | ROUGE-141.35 | 8 | 2mo ago | |
| LaMP-7 (test) | ClusterRAG-H | ROUGE-152.1 | 7 | 14d ago | |
| LaMP-5 (test) | ClusterRAG-H | R-1 Score49 | 7 | 14d ago | |
| LaMP-4 (test) | ClusterRAG-H | R-1 Score19 | 7 | 14d ago | |
| PersonaMem 128K memory corpus 1.0 (test) | Recol. | Revisit Reasons81.41 | 5 | 2mo ago | |
| PersonaMem 32K memory corpus 1.0 (test) | Recol. | Revisit Reasons94.95 | 5 | 2mo ago | |
| PersonaMem 1M memory corpus 1.0 (test) | RF-Mem | Revisit Reasons77.87 | 4 | 2mo ago |