Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Counterfactual Reasoning on CRAFT Hard Split (test)
Loading...
83.64
Accuracy
CRCG_GPT4
51.2856
59.6853
68.085
76.4847
Jun 12, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
CRCG_GPT4
Learning Protocol=Few-...
2025.06
83.64
GPT-4 with CRCG guided prompt
Learning Protocol=Few-...
2025.06
81.22
GPT-4
Learning Protocol=Few-...
2025.06
81.2
BERT-D
Learning Protocol=Supe...
2025.06
79.34
CRCG_GPT3.5
Learning Protocol=Few-...
2025.06
68.48
GPT-3.5 with CRCG guided prompt
Learning Protocol=Few-...
2025.06
65.89
LSTM-D
Learning Protocol=Supe...
2025.06
56
GPT-3.5
Learning Protocol=Few-...
2025.06
52.53
Feedback
Search any
task
Search any
task