Share your thoughts, 1 month free Claude Pro on usSee more

Instruction Following with Long-term Memory on Human Evaluation 1-10 scale (test)

8.7Coherence

EventWeave

Updated 3mo ago

Evaluation Results

Method	Links
EventWeave 2025.03		8.7	8.5	8.4	8.6	8.6
LifeLongMem 2025.03		8	7.9	8.2	7.9	8
MemWalker 2025.03		7.8	7.6	8.1	7.7	7.8
LongMem 2025.03		7.7	7.5	8	7.6	7.7
ProactiveCoT 2025.03		7.5	7.2	8	7.4	7.5
GPT-4o 2025.03		6.8	6.5	7.9	6.7	7