Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Instruction Following on InstructBench

75.27Dolly (BLEU)

SYTTA-8

31.621242.953154.28565.6169Oct 11, 2025
Updated 23d ago

Evaluation Results

MethodLinks
2025.10
75.2775.472.1474.27
2025.10
74.9875.5772.1474.23
2025.10
74.6175.971.9874.17
2025.10
74.5675.8571.9674.12
2025.10
74.5575.672.4374.19
2025.10
74.4475.7172.5374.23
2025.10
74.3375.7571.7773.95
2025.10
74.3376.2671.5374.04
2025.10
74.2276.0572.0374.1
2025.10
74.175.371.8773.76
2025.10
74.0475.8371.6473.84
2025.10
73.9974.4872.9473.81
2025.10
73.9874.4270.9573.11
2025.10
73.9875.4672.3173.92
2025.10
73.9574.3771.4573.26
2025.10
73.8359.8168.9867.54
2025.10
73.8374.7372.4573.67
2025.10
73.7474.3772.2473.45
2025.10
73.7274.2569.872.59
2025.10
73.7276.0972.0673.96
2025.10
73.6876.2571.8573.93
2025.10
73.5475.9471.5873.68
2025.10
73.5174.3771.4173.1
2025.10
73.575.9374.4974.64
2025.10
73.3574.1572.2273.24
2025.10
73.3476.1371.5873.68
2025.10
73.3274.5271.373.05
2025.10
73.2972.8571.4272.52
2025.10
73.2474.9171.2773.14
2025.10
73.1575.9571.6573.58
2025.10
73.1174.772.8773.56
2025.10
7374.7672.0373.26
2025.10
72.9172.0570.1471.7
2025.10
72.7574.7172.0673.17
2025.10
72.5974.5671.4172.86
2025.10
71.9174.772.0372.88
2025.10
71.8974.4470.872.37
2025.10
71.7472.1170.1871.34
2025.10
71.1772.9271.1471.74
2025.10
71.0174.087473.03
2025.10
70.6373.8770.9171.8
2025.10
70.673.5670.6171.59
2025.10
70.5474.4270.7871.91
2025.10
69.9672.8871.4271.42
2025.10
69.8774.3270.6171.6
2025.10
68.0472.8269.7570.2
2025.10
67.871.9266.568.74
2025.10
67.7667.8769.1268.25
2025.10
42.0845.6137.0241.57
2025.10
41.8145.7536.8741.47
2025.10
41.6846.7334.641
2025.10
41.6645.3337.5941.52
2025.10
41.5546.6337.4541.88
2025.10
41.3546.2437.9941.86
2025.10
4146.237.8841.69
2025.10
40.8746.6937.9841.85
2025.10
40.8345.8436.6641.11
2025.10
40.7646.6534.7340.71
2025.10
40.7545.0536.2240.67
2025.10
40.546.1537.141.25
2025.10
40.3846.8736.0441.1
2025.10
40.3746.4234.3540.38
2025.10
40.3342.835.0839.4
2025.10
40.2643.8132.0938.72
2025.10
40.2645.1737.9441.13
2025.10
40.2543.4536.640.1
2025.10
40.1546.2840.6542.36
2025.10
40.0942.7432.2638.36
2025.10
39.844.1239.541.14
2025.10
38.8343.4835.4839.26
2025.10
38.527.7729.8825.39
2025.10
38.5244.6740.0941.09
2025.10
38.442.4938.7239.87
2025.10
38.144.3133.738.7
2025.10
37.9445.6433.3338.97
2025.10
37.7639.1335.3137.4
2025.10
37.6944.4433.5738.57
2025.10
37.644.0839.8340.51
2025.10
37.4344.634.6538.89
2025.10
37.3943.436.6439.14
2025.10
37.0443.2434.4538.24
2025.10
37.0442.934.4638.13
2025.10
36.8537.5628.0234.14
2025.10
36.5143.434.0737.99
2025.10
36.4441.7231.7336.63
2025.10
36.3544.1437.8139.43
2025.10
36.3243.1334.1237.86
2025.10
36.2743.0433.7237.68
2025.10
36.2444.0538.3439.55
2025.10
35.9343.1333.6737.58
2025.10
35.6943.3332.9537.32
2025.10
35.5742.8635.4937.97
2025.10
35.4540.8535.7137.34
2025.10
35.4239.8532.0835.78
2025.10
34.6637.627.9733.41
2025.10
34.6141.2736.1537.34
2025.10
34.1240.5336.1536.93
2025.10
34.0442.230.5935.61
2025.10
33.4639.6732.6535.26
2025.10
33.344.0335.6637.67
Showing 100 of 224 rows