Share your thoughts, 1 month free Claude Pro on usSee more

Long-context dialogue evaluation on LoCoMo

69.26Normalized Score

GLM-5

Updated 4mo ago

Evaluation Results

Method	Links
GLM-5 2026.03		69.26	0.15
MiniMax-M2.5 2026.03		66.04	0.15
Qwen3-Max-Thinking 2026.03		62.11	0.15
DeepSeek-V3.2 2026.03		59.25	0.15
Kimi-K2.5 2026.03		42.91	0.15