Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Needle-in-a-Haystack Retrieval on BABILong 32K context length
Loading...
9
Accuracy
MemDLM (Train & Inference)
6.712
7.306
7.9
8.494
Mar 23, 2026
Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
Accuracy
MemDLM (Train & Inference)
Backbone=LLaDA-MoE, Se...
2026.03
9
MemDLM (Train-Only)
Backbone=LLaDA-MoE, Se...
2026.03
8.5
Standard MDLM
Backbone=LLaDA-MoE
2026.03
6.8
Feedback
Search any
task
Search any
task