Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Memory on LongMemEval
Loading...
75.2
Score
GoLongRL (w. GRPO)
46.496
53.948
61.4
68.852
May 19, 2026
Score
Updated 14d ago
Evaluation Results
Method
Method
Links
Score
GoLongRL (w. GRPO)
Model=Qwen3-30B-A3B-Th...
2026.05
75.2
QwenLong L1.5-30B
Model=QwenLong L1.5-30B
2026.05
72.2
Qwen3-30B-A3B-Thinking-2507 Base
Model=Qwen3-30B-A3B-Th...
2026.05
61.6
GoLongRL (w. TMN-Reweight)
Model=Qwen3-4B-Thinkin...
2026.05
61.2
Qwen3-4B-Thinking-2507 Base
Model=Qwen3-4B-Thinkin...
2026.05
47.6
Feedback
Search any
task
Search any
task