Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA General Instruction Following benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
General Instruction Following
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Arena-Hard
TAG-INSTRUCT
Score
22.1
35
4d ago
Arena-Hard v2
o3
Score
85.9
23
4d ago
WildBench
GenRM-R-Align-14B
Score
92.6
19
4d ago
MMLU style (test)
Student (Self-Distillation)
Accuracy
51.06
3
4d ago
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task