Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Generative Commonsense Reasoning on ComVE
Loading...
78.9
Win-Tie Score
CommonSyn
40.212
50.256
60.3
70.344
Mar 18, 2026
Win-Tie Score
S-B4 Score
Vendi Score
S-Cos Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Win-Tie Score
S-B4 Score
Vendi Score
S-Cos Score
CommonSyn
Model training data/so...
2026.03
78.9
85.9
18.1
30
Vanilla
Model training data/so...
2026.03
75.3
77.3
17.7
29.8
CommonGen
Model training data/so...
2026.03
41.7
86.3
21.6
44.9
Feedback
Search any
task
Search any
task