Share your thoughts, 1 month free Claude Pro on usSee more

Long-context understanding on RULER 32K

94.48Accuracy

Qwen2.5-14B

Updated 2mo ago

Evaluation Results

Method	Links
Qwen2.5-14B 2025.11		94.48
RetroInfer 2025.11		94.41
LiteCache + HATA 2025.11		94.22
HATA Algo 2025.11		94.13
Dense 2026.05		93.01
CompactAttention-FP 2026.05		92.41
Dense Attention 2025.12		92.33
Dense Attention 2025.12		92.33
XAttention 2026.05		92.29
InfiniGen 2025.11		92.19
BLASST 2025.12		92.11
BLASST 2025.12		92.08
BLASST 2025.12		92.07
Dense Attention 2025.12		91.9
Dense Attention 2025.12		91.9
Dense Attention 2025.12		91.9
BLASST 2025.12		91.81
BLASST 2025.12		91.79
BLASST 2025.12		91.74
BLASST 2025.12		91.67
BLASST 2025.12		91.67
BLASST 2025.12		91.55
FlashPrefill 2026.05		91.55
SeerAttention 2026.05		90.23
XAttention 2026.05		89.16
FlashPrefill 2026.05		88.99
CompactAttention-SA 2026.05		88.92
CompactAttention-FP 2026.05		88.77
Dense 2026.05		88.23
RocketKV 2025.12		87.89
QUOKA 2026.05		83.52
QUOKA 2026.05		81.96
HATA Algo 2025.11		80.94
Llama3-8B 2025.11		80.8
LiteCache + HATA 2025.11		80.7
RetroInfer 2025.11		80.64
InfiniGen 2025.11		76.76
Quest 2025.12		56.23