Long-context Evaluation on RULER 32k context Average 13 tasks

0.635Score

Vanilla

Updated 19d ago

Evaluation Results

Method	Links
Vanilla 2026.05		0.635	-	-
Self-Pruned KV 2026.05		0.61	-3.9	17.5