Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Outcome Reasoning on CVQA-Bool

81.2M' (F1 Score)

GPT-5

53.74460.8726875.128May 17, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.05
81.274.5
2025.05
77.170.2
2025.05
70.964.3
2025.05
66.759.8
2025.05
65.458.6
2025.05
63.256.8
2025.05
54.848.1