Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multilingual Commonsense Reasoning on X-CSQA

45.5Accuracy (SW)

MINDMERGER-SOFT

23.34829.09934.8540.601Oct 16, 2025
Updated 15d ago

Evaluation Results

MethodLinks
2025.10
45.546.248.451.460.653.963.362.963.866.86767.168.169.175.278.161
2025.10
40.852.755.663.96561.364.268.367.966.369.770.87270.774.684.265.5
2025.10
36.541.348.444.651.847.153.351.55556.357.354.757.255.571.371.352.3
2025.10
35.751.652.860.96359.362.566.664.764.167.666.968.269.870.182.862.9
2025.10
35.132.637.836.350.549.257.154.856.358.358.358.859.860.363.175.752.3
2025.10
33.129.940.437.752.949.954.755.45859.758.661.962.563.675.275.253.1
2025.10
31.830.530.630.633.333.939.839.838.439.137.436.433.838.238.844.436.1
2025.10
27.629.23228.738.838.745.543.845.946.550.249.151.252.154.367.243.8
2025.10
25.13239.242.256.655.960.662.261.362.866.364.966.267.467.779.356.9
2025.10
24.225.132.932.350.949.150.656.557.5565661.261.763.56476.351.3