Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multiple Choice Question Answering on OlmoBaseEval MC Non-STEM

89.3Aggregate Score

Qwen 2.5 32B

37.92451.26264.677.938Dec 15, 2025Jan 2, 2026Jan 20, 2026Feb 7, 2026Feb 25, 2026Mar 15, 2026Apr 3, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2025.12
89.38588.481.289.993.386.696.886.69779.997.9
2025.12
8985.790.182.481.192.584.996.990.196.281.498.1
2025.12
87.982.788.681.980.5918194.986.597.284.697.9
2025.12
86.780.586.280.27990.381.295.884.695.98297.7
2025.12
86.183.487.479.47991.583.595.170.397.182.497.7
2025.12
85.678.38475.182.385.683.996.487.292.37898.2
2026.04
84.978.684.976.784.189.983.393.678.492.174.697.4
2025.12
84.878.684.876.884.189.983.393.778.392.374.197.5
2025.12
84.578.983.775.480.190.582.493.97195.38197.6
2025.12
84.279.784.575.681.287.782.394.468.696.678.697.4
2026.04
84.178.784.476.378.187.881.594.481.391.873.397.5
2026.04
83.980.585.17875.39080.792.27094.979.197.3
2025.12
83.579.384.976.378.687.381.29264.895.382.496.7
2025.12
83.279.385.876.978.1898194.366.69274.597.5
2025.12
82.976.28374.48588.582.993.569.192.170.596.4
2025.12
81.37882.273.874.48678.792.27090.771.197.4
2025.12
81.374.582.974.275.385.780.392.765.892.872.597.3
2026.04
80.776.880.573.275.985.978.691.660.992.57695.8
2025.12
80.573.680.872.776.187.280.791.464.189.572.296.7
2026.04
80.471.679.77178.482.781.193.968.189.37197
2025.12
78.871.477.468.375.385.779.886.263.790.871.596.5
2025.12
78.574.179.270.176.97979.387.556.593.271.995.7
2025.12
78.268.97566.975.380.280.392.567.386.969.496.9
2026.04
78.269.275.266.875.280.280.392.567.486.969.597
2025.12
76.967.671.864.582.381.583.187.65588.469.294.5
2026.04
76.27973.478.252.882.878.891.271.954.178.197.3
2025.12
76.170.175.569.172.978.37789.953.388.96894.4
2025.12
75.267.973.165.27280.177.58555.689.566.395.3
2025.12
74.267.874.766.172.180.576.382.847.590.366.791.3
2026.04
74.265.870.864.17383.577.586.551.490.267.785.4
2026.04
71.162.168.463.967.876.475.583.348.783.964.287.6
2025.12
6559.560.857.265.571.673.459.744.883.251.387.7
2025.12
64.156.758.955.460.67271.367.3487747.590
2026.04
39.937.639.538.732.551.534.133.633.946.734.356.2