Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Needle-in-a-haystack retrieval on NIAH 64K 60 items, 3 needle positions (test)
Loading...
28.2
F1 Score
Full cache
-1.128
6.486
14.1
21.714
May 18, 2026
F1 Score
Early Retrieval
Middle Retrieval
Late Retrieval
Ceiling Success
Updated 14d ago
Evaluation Results
Method
Method
Links
F1 Score
Early Retrieval
Middle Retrieval
Late Retrieval
Ceiling Success
Full cache
C=65536, Ret%=100%, mo...
2026.05
28.2
29.3
27.5
27.8
100
LRU + prot
C=4096, Ret%=6.3%, mod...
2026.05
11.2
29.7
2.4
1.5
39.8
LRU (no prot)
C=4096, Ret%=6.3%, mod...
2026.05
9.8
28.6
0
0.7
34.7
Random + prot
C=8192, Ret%=12.5%, mo...
2026.05
4.2
2.8
5.8
4
14.9
H2O + prot
C=8192, Ret%=12.5%, mo...
2026.05
3.5
5.6
3.5
1.4
12.4
Random + prot
C=4096, Ret%=6.3%, mod...
2026.05
3
0.7
5.1
3.4
10.6
H2O + prot
C=4096, Ret%=6.3%, mod...
2026.05
2.2
3.4
2.4
0.9
7.8
H2O (no prot)
C=4096, Ret%=6.3%, mod...
2026.05
0
0
0
0
0
Feedback
Search any
task
Search any
task