Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Commonsense Reasoning on BoolQ, PIQA, SIQA, HellaS., WinoG., ARC-e, ARC-c, OBQA

82.88BoolQ Accuracy

FFA-LoRA

63.723268.696673.6778.6434May 31, 2024Sep 21, 2024Jan 12, 2025May 5, 2025Aug 26, 2025Dec 17, 2025Apr 9, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.03
82.8878.6751.8576.374.278149.7945.267.49
82.7779.0354.4877.5475.8182.1651.374969.02
2026.03
82.178.3553.2376.6374.380.5250.6447.767.93
2026.03
81.8977.1548.5676.1269.6178.8750.6842.265.64
80.3479.5450.7278.3773.7279.7649.3245.267.12
2026.03
80.0277.451.972.372.5779.5547.9947.966.2
2026.03
76.5377.1548.7273.5470.7276.6443.4342.163.6
2025.06
76.489.782.595.589.692.984.389.287.5
2025.06
76.190.381.595.789.793.483.487.287.2
2025.06
75.890.880.795.689.993.48389.287.3
2025.06
75.788.481.496.288.292.783.288.686.8
2025.06
75.784.88290.686.890.177.189.284.5
2026.03
75.489.781.295.487.793.382.988.386.7
2025.06
75.189.982.496.388.892.682.889.687.2
2025.06
74.885.580.989.985.788.576.38683.5
2025.06
74.886.280.590.587.188.675.48883.9
2025.06
74.786.981.291.587.489.878.290.284.9
2024.05
74.689.379.995.585.690.580.485.885.2
2024.08
74.689.379.995.585.690.580.485.885.2
2024.08
74.689.881.69686.992.882.186.886.3
2025.06
74.689.379.995.585.690.580.485.885.2
2026.03
74.689.379.995.585.690.580.485.885.2
2024.05
74.588.880.395.584.790.179.187.285
2024.08
74.588.880.395.584.790.179.187.285
2025.06
74.588.880.395.584.790.179.187.285
2026.03
74.588.880.395.584.790.179.187.285
2024.08
74.489.881.196.287.892.98386.886.5
2024.05
74.387.580.994.586.792.181.585.885.4
2024.05
74.388.181.895.187.391.181.787.285.8
2025.06
73.984.881.7908587.976.886.883.4
2026.03
73.689.180.894.885.793.18387.686
2024.08
73.58981.49687.692.982.487.286.3
2025.06
73.385.78190.286.988.677.485.283.5
2026.03
73.185.468.578.566.189.879.974.877
2024.08
7383.980.293.28386.574.48382.2
2024.05
72.987.180.692.185.187.87684.383.2
2024.08
72.683.88093.38387.173.784.882.3
2025.06
72.685.28294.485.787.874.58583.4
2025.06
72.585.379.990.182.982.769.783.680.8
2026.03
72.387.681.594.38791.579.183.884.7
2025.06
72.183.580.590.583.782.868.382.480.5
2024.08
7283.179.989.18384.57181.280.5
2025.06
7283.179.989.18384.57181.280.5
2025.06
7283.880.893.382.886.7748181.8
2026.03
7283.179.989.18384.57181.280.5
2024.08
71.883.77689.182.683.768.282.479.7
2025.06
71.883.77689.182.683.768.282.479.7
2026.03
71.883.77689.182.683.768.282.479.7
2024.10
71.7182.5578.8891.683.0183.0467.3381.76-
2024.08
71.78380.19381.28672.382.281.2
2024.10
71.4683.3279.5491.8683.2283.6567.1281.54-
2024.10
71.4483.5279.591.8483.283.3967.0681.73-
2024.10
71.3983.3378.3292.483.2483.3466.4380.99-
2026.03
71.283.479.588.18486.773.884.681.4
2024.10
71.1983.9979.1591.8683.2483.3567.0581.37-
2026.04
71.1680.9675.1385.8871.3586.2872.2779.477.8
2024.06
70.981.978.684.580.881.764.979.277.8
2024.05
70.885.279.991.784.384.271.27980.8
2024.08
70.885.279.991.784.384.271.27980.8
2025.06
70.885.279.991.784.384.271.27980.8
2026.03
70.885.279.991.784.384.271.27980.8
2024.08
69.879.979.583.682.679.864.78177.6
2025.06
69.879.979.583.682.679.864.78177.6
2026.03
69.879.979.583.682.679.864.78177.6
2026.04
69.7379.9274.9682.9370.4382.766.2177.875.59
2024.06
69.681.677.879.679.881.566.579.477
2026.03
69.581.880.491.783.684.669.278.280
2024.10
69.3680.0178.0989.2876.7376.4660.5576.96-
2024.10
69.3280.0877.9989.4676.4176.4660.5976.9-
2026.04
68.9380.0672.4283.5168.5983.5967.6675.675.05
2024.06
68.980.777.478.178.877.861.374.874.7
2026.03
68.981.480.49180.184.169.173.678.6
2026.04
68.8780.0974.3183.7171.3584.7269.1178.276.3
2025.06
68.886.777.292.985.686.875.581.881.9
2026.04
68.5379.6573.1383.3772.4585.2769.277.676.15
2024.06
68.582.979.684.880.881.465.88178.1
2024.06
68.480.979.480.380.480.264.778.276.6
2024.10
68.3880.9977.88076.3577.6261.3272.94-
2024.06
68.380.679.182.18081.567.979.677.4
2026.04
68.278.7373.0884.1971.6482.8768.9876.675.54
2025.06
67.678.178.476.67875.860.275.673.8
2025.06
67.683.880.188.28282.868.880.679.2
2026.04
67.478.3574.0583.171.5984.9769.87875.91
2024.10
67.3379.4675.876.0472.1171.6757.3369.98-
2025.06
67.181.177.283.678.977.763.274.675.4
2024.10
67.0979.3776.1588.8677.5476.5460.5574.63-
2024.10
67.0378.6976.0688.8576.4776.560.3674.22-
2026.04
66.778.3573.8582.970.2484.8568.0377.475.29
2024.10
66.3474.6473.1255.9372.573.1157.1972.9-
2024.10
66.3374.5373.5656.672.1473.2957.4872.92-
2024.10
65.9268.5369.945.9766.8764.9145.1265.07-
2024.10
65.8973.9273.3356.6571.3973.4657.1572.31-
2024.10
65.567.6369.4645.666.863.5646.8163.82-
2026.02
65.3581.2869.8981.9365.6786.5972.2175.3374.78
2024.10
65.1967.5871.2245.1666.0364.147.7563.92-
2024.10
65.1268.2269.9645.9866.7864.8945.764.81-
2024.10
65.0278.17887.5776.7875.4860.5474.02-
2024.10
64.9474.6872.4955.8968.373.2156.5972.85-
2026.02
64.5781.0769.7481.0964.0986.4672.0176.3374.42
2026.04
64.4676.8266.0273.6160.4680.0164.687069.51
Showing 100 of 129 rows