Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context Question Answering on LongMemEval-S cross-benchmark replication Full-500

50.4Jaccard Score (S-4.5, 3x)

zenbrain

36.46440.08243.747.318Apr 26, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
50.457.555.5
2026.04
4551.349.2
2026.04
38.943.641.6
2026.04
3741.439.8