Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multiple-choice Question Answering on MMLU Pro (Subject Accuracy)

82.8Biology Accuracy

GRPO

31.3244.68558.0571.415Apr 13, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.04
82.87963.472.573.452.569.770.5
2026.04
82.876.763.970.772.951.566.869.3
2026.04
82.678.264.37373.252.769.570.5
2026.04
81.18063.369.87350.668.869.5
2026.04
80.47261.163.770.948.166.866.1
2026.04
77.36548.450.363.235.359.156.9
2026.04
76.966.849.553.668.137.26058.9
2026.04
76.765.548.252.167.639.559.158.4
2026.04
76.763.750.352.465.136.759.957.8
2026.04
76.666.450.253.766.932.461.458.2
2026.04
73.654.441.941.551274848.2
2026.04
72.954.742.643.850.527.749.148.8
2026.04
70.351.437.442.749.130.242.846.3
2026.04
67.753.741.840.950.326.845.146.6
2026.04
65.349.635.73949.225.343.744
2026.04
48.339.822.127.440.513.226.431.1
2026.04
46.840.122.728.739.813.225.731
2026.04
44.238.221.721.141.715.927.930.1
2026.04
42.135.120.523.842.110.224.128.3
2026.04
33.32314.914.629.811.619.120.9