Share your thoughts, 1 month free Claude Pro on usSee more

Long-context language understanding on LongBench (Specific Metric Subset)

87.77TriviaQA Score

MemDLM (Train & Inference)

Updated 4mo ago

Evaluation Results

Method	Links
MemDLM (Train & Inference) 2026.03		87.77	54.69	87.38	31.97	43.28	22.61	22.34	26.7	71	16.14	64.25	55.23
MemDLM (Train-Only) 2026.03		87.74	54.36	86.29	31.44	42.87	22.43	22.19	26.35	70.5	15.8	64.2	55.05
Standard MDLM 2026.03		55.29	50.32	74.5	29.22	42.47	21.85	22.13	23.52	70	13.59	62.55	54.65