Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context Classification on OOLONG (test)

58.1TREC-Q-coarse Accuracy

RLM + PEEK

29.18836.69444.251.706May 19, 2026
Updated 14d ago

Evaluation Results

MethodLinks
2026.05
58.169.457
2026.05
48.861.642
2026.05
4249.530
2026.05
36.663.129
2026.05
3249.623
2026.05
30.346.523