Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Open-Critic

Benchmarks

Task NameDataset NameSOTA ResultTrend
Causal Variable IdentificationOpen-Critic
F1 (X)71.7
7
Outcome ReasoningOpen-Critic
M' (F1 Mean)75.3
7
Showing 2 of 2 rows