Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Commonsense Reasoning on Commonsense QA
Loading...
3.9
Average Relative Improvement
TBDF
-0.2496
0.8277
1.905
2.9823
Jan 29, 2026
Average Relative Improvement
Inferior/Superior Case Counts
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Relative Improvement
Inferior/Superior Case Counts
TBDF
Filtering Mode=General...
2026.01
3.9
-
TBDF
Filtering Mode=FW-EDU,...
2026.01
3.41
-
CB
Filtering Mode=FW-EDU,...
2026.01
-0.09
-
Feedback
Search any
task
Search any
task