Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WQ

Benchmarks

Task NameDataset NameSOTA ResultTrend
Open-Domain Question AnsweringWQ (test)
EM33.71
37
Reward ModelingWQ Arena
Accuracy65.29
22
Open-domain retrievalWQ
Recall@2073.2
9
Question AnsweringWQ
Accuracy45.5
8
Showing 4 of 4 rows