Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OpenAI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-based safety moderationOpenAI
F1 Score82.3
26
Question AnsweringOpenAI (in-domain)
Accuracy0.8956
12
Diverse Nearest Neighbor SearchOpenAI dataset
Search Cost0.331
4
Showing 3 of 3 rows