Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Causal Variable Identification on HumanEval Exe

71.4F1 (X)

GPT-o4

59.12862.31465.568.686May 17, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.05
71.469.26670.3
2025.05
69.366.963.267.4
2025.05
68.966.76367.5
2025.05
67.865.761.966.3
2025.05
63.761.951.959.7
2025.05
62.159.853.660.5
2025.05
59.657.349.757