Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Commonsense QA Suite (ARC-E, ARC-C, HellaS, WinoG, BoolQ, OBQA, RTE, CoPa, Race) Zero-Shot

81.19ARC-Easy Accuracy

Dense

30.323643.529356.73569.9407May 15, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.05
81.1953.4178.9273.6482.1144.869.688739.1467.77
2026.05
77.6553.4179.1672.3881.354569.68894067.51
2026.05
68.014366.671.5971.3137.268.597637.860.01
2026.05
65.8742.4174.3567.882.9141.269.688941.6363.87
2026.05
65.8243.6966.5571.2774.3437.266.437936.5660.1
2026.05
64.943.5263.871.6769.637.871.487737.3259.68
2026.05
63.1343.8664.0772.8571.313868.597736.2759.45
2026.05
61.3233.1159.5554.2243.7635.251.627731.2949.67
2026.05
58.8432.6859.1653.7545.3834.454.157530.7249.34
2026.05
58.2942.1564.9668.3561.9934.469.688034.4557.14
2026.05
57.6237.2954.9165.4360.4935.869.686929.1953.27
2026.05
57.0337.2952.1464.2539.3635.862.826831.149.75
2026.05
56.6542.4164.6971.3565.1432.867.877534.1656.67
2026.05
55.8137.0361.367.2568.473772.928139.4357.8
2026.05
55.3537.0359.9766.4669.0835.275.098037.857.33
2026.05
54.0836.1853.2964.2572.9733.862.827232.9253.59
2026.05
52.135.6752.3866.6939.2434.663.96730.0549.07
2026.05
51.0936.0953.4864.1758.013658.847433.4951.69
2026.05
51.0935.3249.5962.2762.7834.666.437330.0551.68
2026.05
50.9334.9847.362.1974.6530.471.486831.5852.39
2026.05
50.6736.648.5562.5171.313171.84683152.39
2026.05
50.5134.5649.963.7757.4933.467.156830.1450.55
2026.05
50.0834.6452.7369.372.6331.869.687232.4453.92
2026.05
49.9235.2446.3361.5651.9932.457.47027.2748.01
2026.05
49.1237.0355.9563.0677.093474.737133.2155.02
2026.05
48.6529.2749.8151.7861.6530.650.97128.2346.88
2026.05
48.3632.6847.3661.5663.73568.236829.2850.46
2026.05
48.2334.5653.3569.7775.5732.465.77132.3453.66
2026.05
48.1534.5645.0461.475.4731.666.796831.2951.37
2026.05
46.3435.1547.7261.0171.743170.766830.9151.4
2026.05
46.2530.8944.2259.1253.913560.296728.6147.25
2026.05
45.7129.6955.6657.764.6530.450.547630.6249
2026.05
45.5834.943.9758.3377.4631.668.236429.5750.4
2026.05
44.9931.0643.4456.1253.1231.860.656426.7945.77
2026.05
44.5334.0447.2367.473.5529.870.766729.9551.58
2026.05
44.1933.1133.3956.9938.232.658.126125.8442.6
2026.05
43.8631.7444.1460.8561.2533.665.346829.0948.65
2026.05
43.6932.7644.2259.1977.3430.470.046530.7250.37
2026.05
43.3532.5937.0256.7558.1329.668.236231.9646.63
2026.05
42.9729.6941.6458.8852.8433.862.097027.0846.55
2026.05
42.9333.8747.367.475.7532.464.626931.6751.66
2026.05
42.7232.5948.0964.3375.4131.874.016634.0752.11
2026.05
42.6833.4538.2758.5661.929.262.096030.0546.24
2026.05
41.7528.8442.8454.355.2327.853.797026.9944.62
2026.05
40.1930.6344.761.466.453274.736533.7849.88
2026.05
39.6928.8433.1855.4938.0729.657.46024.0240.7
2026.05
39.6930.2931.4956.1255.1130.469.686128.3344.68
2026.05
39.6930.2931.4956.1255.1130.469.686128.3344.68
2026.05
38.5430.3829.0550.9159.828.250.266229.3842.05
2026.05
38.5127.340.553.2862.0829.453.795725.1743
2026.05
38.1729.6933.0456.6756.1530.270.045727.1844.24
2026.05
38.1729.6933.0456.6756.1530.270.045727.1844.24
2026.05
38.0530.5540.7454.9377.5228.667.516530.8148.19
2026.05
37.0831.6638.7255.1775.227.464.266326.2246.52
2026.05
37.0831.6638.7255.1775.227.464.266326.2246.52
2026.05
36.4529.136.3250.9964.6227.460.295624.442.84
2026.05
32.282537.7850.9955.930.449.465823.2540.34