Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NQ-Open

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringNQ-Open (val)
Accuracy30.7
28
Hallucination detectionNQ-Open
AUROC0.8843
27
Factual Question AnsweringNQ-Open ID
Precision57.34
24
Question AnsweringNQ-open v1.0 (test)
A179.08
16
Question AnsweringNQ-Open (out-of-domain)
Precision0.705
12
Showing 5 of 5 rows