Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Compositional Generalization on Evaluation Dataset (Full)

0.6379Score

Gemma-7B + COGLM

0.1840440.3018720.41970.537528Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.6379
2026.01
0.6206
2026.01
0.6075
2026.01
0.6006
2026.01
0.5963
2026.01
0.5937
2026.01
0.5906
2026.01
0.5668
2026.01
0.4405
2026.01
0.4211
2026.01
0.4016
2026.01
0.377
2026.01
0.369
2026.01
0.3569
2026.01
0.3563
2026.01
0.2982
2026.01
0.2979
2026.01
0.2015