Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Compositional Reasoning Dataset

43.3Correction Score (C)

CREME

-0.785610.659722.10533.5503Feb 22, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.02
43.323.713.611.24
2024.02
177.991.270.86
2024.02
7.2713.50.6
2024.02
3.22.313.10.3
2024.02
2.210.30.3226.72
2024.02
1.2---
2024.02
0.980.450.752.93
2024.02
0.91---