Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reasoning on BBH, MMLU, ARC-C, and ThmQA (test)

64.66BBH

Teacher

1.37617.805534.23550.6645May 25, 2026
Updated 7d ago

Evaluation Results

MethodLinks
2026.05
64.6678.2289.932.4766.31
2026.05
57.7270.983.5818.157.58
2026.05
52.5766.1781.2326.1256.52
2026.05
46.5369.0581.2323.8255.16
2026.05
46.5369.0581.2323.8255.16
2026.05
45.766.8802454.1
2026.05
45.564.6878.8522.2752.83
2026.05
45.564.6878.8522.2752.83
2026.05
45.1264.9579.8122.9453.21
2026.05
45.1264.9579.8122.9453.21
2026.05
44.7164.6979.2322.2552.72
2026.05
44.7164.6979.2322.2552.72
2026.05
44.0765.6777.6824.252.9
2026.05
44.0765.6777.6824.252.9
2026.05
4465.678.122.952.6
2026.05
41.6564.4578.3323.0251.87
2026.05
41.5265.7678.7523.7552.45
2026.05
41.3964.6777.7523.7751.89
2026.05
41.3964.6777.7523.7751.89
2026.05
41.2263.9576.9123.6751.44
2026.05
41.2263.9576.9123.6751.44
2026.05
40.564.5278.1122.8351.49
2026.05
40.564.5278.1122.8351.49
2026.05
34.467.279.523.151
2026.05
28.1645.0347.5111.3733.02
2026.05
28.1645.0347.5111.3733.02
2026.05
27.4435.6437.01526.27
2026.05
26.7444.3546.7610.232.01
2026.05
26.7444.3546.7610.232.01
2026.05
26.5849.0851.6910.534.46
2026.05
26.5849.0851.6910.534.46
2026.05
26.1842.5946.711.0731.63
2026.05
26.1842.5946.711.0731.63
2026.05
26.0441.8545.8911.3331.28
2026.05
26.0441.8545.8911.3331.28
2026.05
25.8744.1247.1210.6531.94
2026.05
25.8744.1247.1210.6531.94
2026.05
25.243332.244.2723.69
2026.05
25.1833.8633.744.8524.41
2026.05
25.1634.0834.425.724.84
2026.05
25.0734.7834.95.2725.01
2026.05
25.0531.3932.344.7323.37
2026.05
24.9144.7247.441132.02
2026.05
24.9144.7247.441132.02
2026.05
24.5543.3144.7811.8231.12
2026.05
24.5543.3144.7811.8231.12
2026.05
24.4243.1946.8410.3331.2
2026.05
23.6931.6332.083.622.75
2026.05
23.5527.4128.992.6820.66
2026.05
22.3464.6178.412.2244.39
2026.05
22.0733.1333.414.3723.25
2026.05
15.2922.5423.983.8816.42
2026.05
14.0119.7821.572.2214.4
2026.05
10.2736.8340.5811.3524.76
2026.05
9.5224.7727.581.2615.78
2026.05
7.7825.9426.075.2416.26
2026.05
7.2625.4926.074.3415.79
2026.05
7.1425.2726.034.5715.75
2026.05
7.0125.0826.145.0115.81
2026.05
6.4715.9415.70.959.76
2026.05
6.2925.0725.831.2914.62
2026.05
5.9124.05254.2514.8
2026.05
5.8325.1725.933.3815.08
2026.05
5.4724.7624.074.2314.63
2026.05
5.2818.1118.093.1511.16
2026.05
3.8144.6743.727.7824.99