Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NIAH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-needle retrievalNIAH (M)
Accuracy (NIAH M)90.2
35
Long-context retrievalNIAH multivalue
Speedup4.1
20
Long Context RetrievalNIAH-Multi
Accuracy100
13
Long-context retrievalNIAH (avg)
Score (4k Context)100
7
Long ContextNIAH
Accuracy99.8
6
Long-context retrievalNIAH 32k
NIAH Score99
6
Long-context retrievalNIAH 16k
NIAH Score98.6
6
Needle-in-a-haystackNIAH Needle-in-a-haystack
NIAH Success Rate (32K Context)100
6
Needle-in-a-haystackNIAH 1
Success Rate (1k Context)79.69
5
Needle-in-a-haystackNIAH-2 (test)
NIAH-2 Success Rate (1k)79.61
5
Long-context recallNIAH Single-3
Recall @ 32K Context100
4
Long-context recallNIAH Single 2
Recall @ 32K Context1
4
Long-context recallNIAH Single-1
Recall @ 32K100
4
Showing 13 of 13 rows