Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WikiBigEdit

Benchmarks

Task NameDataset NameSOTA ResultTrend
Model EditingWikiBigEdit
MMLU69.5
34
Model EditingWikiBigEdit 3,000 samples (test)
Reliability99.9
13
Showing 2 of 2 rows