Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Episodic Reasoning on Complex-TR
Loading...
90.6
F1 Score
REMem-I
41.2
54.025
66.85
79.675
Feb 13, 2026
F1 Score
BLEU-1 Score
LLM-J Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
BLEU-1 Score
LLM-J Score
REMem-I
mode=iterative, TISER=...
2026.02
90.6
86
92
NV-Embed-v2
Parameters=7B, TISER=t...
2026.02
88.1
83.6
88.3
REMem-I
mode=iterative
2026.02
83.3
77.6
89.6
REMem-S
mode=single-step
2026.02
78.5
72.7
82.6
HippoRAG 2
Category=Structure-Aug...
2026.02
78.2
72.7
81.5
NV-Embed-v2
Parameters=7B, Categor...
2026.02
77.5
71.9
80.4
Qwen3-Embed-8B
Category=Large Embeddi...
2026.02
77.1
71.4
80.9
Graphiti
Category=Structure-Aug...
2026.02
76.6
71.4
78.8
Full-Context
mode=Full context window
2026.02
74.2
68
81.6
Mem0
Category=Structure-Aug...
2026.02
43.1
35.1
41
Feedback
Search any
task
Search any
task