Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Commonsense Validation and Explanation on ComVE
Loading...
0.88
Performance (F+B -> P+B)
T5-base
0.80928
0.82764
0.846
0.86436
May 4, 2023
Performance (F+B -> P+B)
Performance (F+B -> P+I)
Simulatability Score
Performance (F+I -> P+I)
TREU Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Performance (F+B -> P+B)
Performance (F+B -> P+I)
Simulatability Score
Performance (F+I -> P+I)
TREU Score
T5-base
Backbone=T5-base
2023.05
0.88
0.527
-0.353
0.949
-0.284
BART-base
Backbone=BART-base
2023.05
0.812
0.596
-0.216
0.864
-0.164
Feedback
Search any
task
Search any
task