Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Causal Reasoning on XCOPA (test)

97.2Accuracy (id)

PaLM 2

56.6467.1777.788.23Apr 30, 2020Jan 3, 2021Sep 9, 2021May 15, 2022Jan 19, 2023Sep 24, 2023May 30, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2023.05
97.294.4-97.691.498.476.892.896.297.896.496.897.4
2023.05
9489.9-9189.697.466.885.490.894.690.294.694.8
2023.05
92.683.7-77.478966169.485.492.887.289.891.6
2023.05
92.283-75.677.295.860.668.88492.486.889.490.6
2020.04
7161.266.859.45061.64658.86063.262.267.667.4
2020.04
65.861.568.361.353.76352.556.361.961.860.366.167.6
2020.04
6559.766.85851.460.251.25258.46256.665.668.8
2024.05
62.8------53.8-55.856.862.2-
2024.05
62------58.8-56.65762.8-
2024.05
61.6------52.6-5556.260.6-
2024.05
61.4------56-55.657.460.4-
2024.05
60.2------53.2-53.457.459.4-
2024.05
58.2------51.8-5357.257-
2025.02
-84.4-----------
2025.02
-88.6-----------
2025.02
-89.2-----------