Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Commonsense Reasoning on Winogrande (Relative Improvement Metrics)
Loading...
2.79
Avg Relative Improvement
TBDF
-0.3716
0.4492
1.27
2.0908
Jan 29, 2026
Avg Relative Improvement
Inferior/Superior Counts
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg Relative Improvement
Inferior/Superior Counts
TBDF
Filtering Mode=General...
2026.01
2.79
-
TBDF
Filtering Mode=FW-EDU,...
2026.01
1.43
-
CB
Filtering Mode=FW-EDU,...
2026.01
-0.25
-
Feedback
Search any
task
Search any
task