Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Out-of-domain Generalization benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Out-of-domain Generalization
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Diplomat, Mutual, Quality, CoQA, and Qasper Out-of-Domain Average (test)
AutoMix
Score
70.9
9
4d ago
OOD Suite BBH, HumanEval, MMLU, TruthfulQA
PACE
BBH Score
59.1
4
4d ago
Showing 2 of 2 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task