Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Instruction Following on DomainBench

21.85Agriculture Score

SYTTA

-0.46845.325811.1216.9142Oct 11, 2025
Updated 23d ago

Evaluation Results

MethodLinks
2025.10
21.8530.9322.2629.5726.15
2025.10
21.1428.8126.8330.2526.76
2025.10
20.1729.4726.4829.5826.43
2025.10
20.1229.4517.5929.0724.06
2025.10
20.0930.5731.0530.1427.96
2025.10
19.7226.7417.6429.123.3
2025.10
19.6422.155.3128.5918.92
2025.10
19.5626.5225.0329.5525.16
2025.10
19.5230.4528.9129.5327.1
2025.10
19.429.3729.5629.6727
2025.10
18.8228.7229.7929.9526.82
2025.10
18.3727.1517.8528.1822.89
2025.10
18.3129.4725.9229.7925.87
2025.10
17.6829.4229.7429.6926.63
2025.10
16.4929.5224.8229.525.08
2025.10
16.3328.8525.7128.9524.96
2025.10
16.321.2412.8322.5718.23
2025.10
15.3828.3119.6628.2822.91
2025.10
15.1729.1921.2329.8623.86
2025.10
14.2327.5624.2926.7323.2
2025.10
11.2326.2129.6728.1323.81
2025.10
11.0928.732.229.4825.37
2025.10
10.6723.4614.4224.3618.23
2025.10
9.4322.0312.5123.8816.96
2025.10
8.5922.2813.5321.6516.51
2025.10
8.5715.8514.1813.2112.95
2025.10
8.3816.369.4312.5811.69
2025.10
8.3422.0214.1321.4516.48
2025.10
8.1915.377.1912.0110.69
2025.10
8.115.6213.6112.7912.53
2025.10
8.0515.2811.3813.0211.93
2025.10
7.7115.8314.112.6412.57
2025.10
7.6816.2514.7912.0112.68
2025.10
7.5215.7913.2912.0412.16
2025.10
7.3215.154.2411.969.67
2025.10
7.2915.5414.0112.2112.26
2025.10
7.2615.3113.2212.4812.07
2025.10
7.2611.829.3610.759.8
2025.10
7.1114.833.7611.929.4
2025.10
7.0715.059.2111.9110.81
2025.10
7.0515.8211.1412.1711.54
2025.10
6.9815.3114.9812.3912.42
2025.10
6.8615.8410.8612.5411.53
2025.10
6.615.186.6611.5710
2025.10
6.5713.764.1211.959.1
2025.10
6.4315.179.6111.8910.78
2025.10
6.4116.4710.9912.2711.54
2025.10
6.3816.5314.7612.3512.5
2025.10
6.259.493.6410.637.5
2025.10
6.0915.4310.9211.9111.09
2025.10
6.0815.647.3312.1110.29
2025.10
6.0715.8513.6712.0411.91
2025.10
6.0314.417.3711.249.76
2025.10
6.0216.229.6512.5111.1
2025.10
5.7313.894.1911.628.86
2025.10
5.7316.059.0311.9910.7
2025.10
5.6514.617.2311.719.8
2025.10
5.6414.214.4211.28.87
2025.10
5.4615.7814.0611.4511.69
2025.10
5.3714.898.4910.589.83
2025.10
5.3215.094.111.69.03
2025.10
5.3115.8714.2711.8311.82
2025.10
5.0713.393.6210.638.18
2025.10
4.9227.8914.8728.1918.97
2025.10
4.5414.714.6410.318.55
2025.10
4.3114.416.7512.0711.88
2025.10
4.213.2314.8511.7911.02
2025.10
4.1410.13.848.696.69
2025.10
3.639.53.58.676.33
2025.10
3.329.973.287.486.01
2025.10
3.3212.332.222.945.2
2025.10
3.1112.963.187.216.61
2025.10
3.058.212.775.174.8
2025.10
3.049.812.77.425.74
2025.10
2.6211.822.628.516.39
2025.10
1.8828.233.1727.9715.31
2025.10
1.526.431.8614.66.1
2025.10
1.163.790.7413.224.73
2025.10
0.984.599.322.334.3
2025.10
0.394.895.490.032.7