Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Commonsense Reasoning on BoolQ, PIQA, SIQA, HellaS., WinoG., ARC-e, ARC-c, OBQA

89.69BoolQ Accuracy

LoRA-GA

70.65875.59980.5485.481May 31, 2024Sep 24, 2024Jan 19, 2025May 15, 2025Sep 9, 2025Jan 3, 2026Apr 30, 2026
Updated 22h ago

Evaluation Results

MethodLinks
2025.10
89.6984.986186.5885.3286.1162.2961.877.22-
2025.10
89.6985.4761.7286.7683.3587.0864.0862.277.54-
2025.10
89.285.6460.1385.9985.2486.9563.1461.877.26-
2025.10
89.286.1861.8286.5184.5386.5765.6162.477.85-
2025.10
89.1486.0762.3386.4883.3586.5364.686277.57-
2025.10
88.9985.0960.9586.0982.6486.6262.296276.83-
2025.10
88.8786.0760.6486.1184.5387.1263.9162.477.46-
2025.10
88.5686.1860.2986.6982.487.7964.0862.277.27-
2025.10
88.2982.760.5483.158282.7957.685974.52-
2025.10
87.9283.0360.1383.382.8783.2556.8358.474.34-
2025.10
87.882.4860.0883.2382.5682.9558.115874.4-
2025.10
87.7782.4360.0883.4382.0883.5458.1158.674.51-
2025.10
87.7182.9759.8383.3881.6982.8355.5557.673.95-
2025.10
87.5882.2660.4983.5281.6983.7558.5360.274.75-
2025.10
87.4982.5459.8882.5679.0883.5958.0257.473.82-
2025.10
87.481.6659.1682.4579.4882.9157.5958.473.63-
2026.04
84.474.850.248.174.258.247.645.460.4-
2026.04
83.673.249.847.87454.443.643.258.7-
2026.04
8367.651.846.870.446.743.240.456.2-
2026.03
82.8878.6751.8576.374.278149.7945.267.49-
82.7779.0354.4877.5475.8182.1651.374969.02-
2026.03
82.178.3553.2376.6374.380.5250.6447.767.93-
2026.03
81.8977.1548.5676.1269.6178.8750.6842.265.64-
2026.04
81.868.252.647.370.258.942.839.257.6-
2026.04
81.270.348.446.571.652.843.442.457.1-
2026.04
80.469.150.445.970.256.143.241.857.1-
80.3479.5450.7278.3773.7279.7649.3245.267.12-
2026.03
80.0277.451.972.372.5779.5547.9947.966.2-
2026.03
76.5377.1548.7273.5470.7276.6443.4342.163.6-
2025.06
76.489.782.595.589.692.984.389.287.5-
2025.06
76.190.381.595.789.793.483.487.287.2-
2025.06
75.890.880.795.689.993.48389.287.3-
2025.06
75.788.481.496.288.292.783.288.686.8-
2025.06
75.784.88290.686.890.177.189.284.5-
2025.06
75.790.583.296.589.493.683.990.287.88-
2026.03
75.489.781.295.487.793.382.988.386.7-
2025.06
75.48881.896.589.393.1838686.64-
2025.06
75.189.982.496.388.892.682.889.687.2-
2025.09
74.8992.7682.6295.589.2497.4392.1991.4189.5-
2025.09
74.8692.8282.7595.7489.1597.5892.2491.689.59-
2025.06
74.885.580.989.985.788.576.38683.5-
2025.06
74.886.280.590.587.188.675.48883.9-
2025.06
74.884.782.294.48689.276.489.684.66-
2025.06
74.786.981.291.587.489.878.290.284.9-
2024.05
74.689.379.995.585.690.580.485.885.2-
2024.08
74.689.379.995.585.690.580.485.885.2-
2024.08
74.689.881.69686.992.882.186.886.3-
2025.06
74.689.379.995.585.690.580.485.885.2-
2026.03
74.689.379.995.585.690.580.485.885.2-
2025.06
74.687.481.294.787.189.479.586.485.04-
2024.05
74.588.880.395.584.790.179.187.285-
2024.08
74.588.880.395.584.790.179.187.285-
2025.06
74.588.880.395.584.790.179.187.285-
2026.03
74.588.880.395.584.790.179.187.285-
2025.09
74.4787.9580.6794.8885.7289.579.6985.8184.84-0.001
2024.08
74.489.881.196.287.892.98386.886.5-
2025.09
74.3888.2480.1794.8585.6889.6979.486.184.810.04
2024.05
74.387.580.994.586.792.181.585.885.4-
2024.05
74.388.181.895.187.391.181.787.285.8-
2025.09
74.1787.4879.6194.7885.0689.0979.385.1984.340.6
2025.06
73.984.881.7908587.976.886.883.4-
2025.06
73.884.28194.785.288.975.684.883.53-
2026.03
73.689.180.894.885.793.18387.686-
2024.08
73.58981.49687.692.982.487.286.3-
2025.09
73.3989.2580.6995.485.3590.8177.7686.684.9-
2025.09
73.3486.2480.3193.9783.5788.0176.5383.883.221.92
2025.06
73.385.78190.286.988.677.485.283.5-
2025.06
73.383.78194.384.688.375.884.883.22-
2026.03
73.185.468.578.566.189.879.974.877-
2024.08
7383.980.293.28386.574.48382.2-
2024.05
72.987.180.692.185.187.87684.383.2-
2026.04
72.8786.2882.5995.2186.5888.5976.3685.284.21-
2025.05
72.8487.9877.7992.8279.493.1483.6288.284.47-
2026.04
72.6285.0381.5294.8185.7988.2975.4285.483.61-
2024.08
72.683.88093.38387.173.784.882.3-
2025.06
72.685.28294.485.787.874.58583.4-
2025.06
72.585.379.990.182.982.769.783.680.8-
2025.06
72.585.380.887.286.187.174.385.682.36-
2026.04
72.3885.9681.5294.8686.588.8874.0684.483.57-
2025.05
72.3287.3276.8691.0781.7692.4682.878984.19-
2026.03
72.387.681.594.38791.579.183.884.7-
2026.04
72.1483.480.594.1785.7987.2473.3781.282.23-
2025.06
72.183.580.590.583.782.868.382.480.5-
2024.08
7283.179.989.18384.57181.280.5-
2025.06
7283.179.989.18384.57181.280.5-
2025.06
7283.880.893.382.886.7748181.8-
2026.03
7283.179.989.18384.57181.280.5-
2025.05
71.986.9676.2891.5578.7692.883.1185.483.35-
2026.04
71.8985.0380.9693.9185.4786.7873.7286.483.02-
2024.08
71.883.77689.182.683.768.282.479.7-
2025.06
71.883.77689.182.683.768.282.479.7-
2026.03
71.883.77689.182.683.768.282.479.7-
2025.06
71.885.380.993.484.5907784.883.46-
2024.10
71.7182.5578.8891.683.0183.0467.3381.76--
2024.08
71.78380.19381.28672.382.281.2-
2025.05
71.5587.9577.2791.879.7192.6782.1686.483.69-
2024.10
71.4683.3279.5491.8683.2283.6567.1281.54--
2025.05
71.4687.5976.3592.1178.299280.6385.683-
2024.10
71.4483.5279.591.8483.283.3967.0681.73--
2024.10
71.3983.3378.3292.483.2483.3466.4380.99--
Showing 100 of 223 rows