Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CausalProbe

Benchmarks

Task NameDataset NameSOTA ResultTrend
Causal Question AnsweringCausalProbe-H
EM0.701
32
Causal ReasoningCausalProbe-E
Accuracy80.5
3
Showing 2 of 2 rows