Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Commonsense Reasoning on CoS-E v1.11
Loading...
0.608
Score (Finetune+Baseline vs Predict+Baseline)
T5-base
0.4364
0.48095
0.5255
0.57005
May 4, 2023
Score (Finetune+Baseline vs Predict+Baseline)
Score (Finetune+Baseline vs Predict+Infusion)
Simulatability Score
Score (Finetune+Infusion vs Predict+Infusion)
TREU Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score (Finetune+Baseline vs Predict+Baseline)
Score (Finetune+Baseline vs Predict+Infusion)
Simulatability Score
Score (Finetune+Infusion vs Predict+Infusion)
TREU Score
T5-base
Backbone=T5-base
2023.05
0.608
0.61
0.002
0.803
0.197
BART-base
Backbone=BART-base
2023.05
0.443
0.449
0.006
0.7
0.263
Feedback
Search any
task
Search any
task