Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Episodic Reasoning on Test of Time 2,800
Loading...
93.1
EM
REMem-I
65.852
72.926
80
87.074
Feb 13, 2026
EM
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
REMem-I
mode=iterative
2026.02
93.1
REMem-I
mode=iterative, TISER=...
2026.02
90.6
Full-Context
mode=Full context window
2026.02
79.7
REMem-S
mode=single-step
2026.02
72.5
Qwen3-Embed-8B
Category=Large Embeddi...
2026.02
70.3
NV-Embed-v2
Parameters=7B, Categor...
2026.02
68.9
NV-Embed-v2
Parameters=7B, TISER=t...
2026.02
68.9
HippoRAG 2
Category=Structure-Aug...
2026.02
66.9
Feedback
Search any
task
Search any
task