Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Needle-in-a-Haystack Retrieval on RULER 16K context length
Loading...
29.4
RULER-MV Score
MemDLM (Train & Inference)
22.068
23.9715
25.875
27.7785
Mar 23, 2026
RULER-MV Score
RULER-VT Score
RULER-CWE Score
Updated 25d ago
Evaluation Results
Method
Method
Links
RULER-MV Score
RULER-VT Score
RULER-CWE Score
MemDLM (Train & Inference)
Backbone=LLaDA-MoE, Se...
2026.03
29.4
56.84
57.24
MemDLM (Train-Only)
Backbone=LLaDA-MoE, Se...
2026.03
25.48
55.3
54.25
Standard MDLM
Backbone=LLaDA-MoE
2026.03
22.35
52.56
44.2
Feedback
Search any
task
Search any
task