Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Needle-in-a-haystack on NIAH 1
Loading...
79.69
Success Rate (1k Context)
Avey-B
67.262
70.4885
73.715
76.9415
Feb 17, 2026
Success Rate (1k Context)
Success Rate (2k Context)
Success Rate (4k Context)
Success Rate (8k Context)
Success Rate (16k Context)
Success Rate (32k Context)
Success Rate (64k Context)
Success Rate (96k Context)
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate (1k Context)
Success Rate (2k Context)
Success Rate (4k Context)
Success Rate (8k Context)
Success Rate (16k Context)
Success Rate (32k Context)
Success Rate (64k Context)
Success Rate (96k Context)
Avey-B
Scale=Large
2026.02
79.69
79.24
79.03
79.58
79.44
78.44
76.76
76.06
NeoBERT
Scale=Medium
2026.02
79.65
79.13
74.73
-
-
-
-
-
Avey-B
Scale=Base
2026.02
79.41
79.21
78.94
79.19
78.91
77.73
77.18
75.72
ModernBERT
Scale=Large
2026.02
68.8
67.52
67.2
-
-
-
-
-
ModernBERT
Scale=Base
2026.02
67.74
67.64
68.31
70.67
-
-
-
-
Feedback
Search any
task
Search any
task