Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Outcome Reasoning on CLOMO

90.2M' F1 Mean

GPT-5

66.2872.4978.784.91May 17, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.05
90.285.3
2025.05
88.783.9
2025.05
82.977.2
2025.05
80.574.3
2025.05
79.372.8
2025.05
77.871.6
2025.05
67.260.9