Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CausalProbe

Benchmarks

Task NameDataset NameSOTA ResultTrend
Causal Question AnsweringCausalProbe-H
EM0.701
32
Causal ReasoningCausalProbe-E
Accuracy80.5
3
Showing 2 of 2 rows