Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Modeling on VLRewardBench (test)

84.7General

IXC-2.5-Reward

4.125.02545.9566.875Jan 21, 2025Apr 10, 2025Jun 28, 2025Sep 15, 2025Dec 3, 2025Feb 20, 2026May 10, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2025.01
84.762.562.965.870
2025.12
60.678.460.566.1-
2026.05
59.788.372.680.173.5
2026.05
55.886.148.37263.4
2025.12
55.77264.867.8-
2026.05
55.387.765.977.569.6
2025.01
54.638.359.141.244
2026.05
54.638.359.144.545.2
2026.05
51.987.150.873.263.3
2025.01
50.872.564.267.262.5
2025.01
49.167.670.565.862.4
2025.12
49.167.670.565.8-
2025.12
48.151.348.150-
2025.01
47.859.658.457.655.3
2025.12
47.551.351.950.9-
2025.12
4771.364.566-
2026.05
4772.443.261.354.2
2026.05
46.464.93654.949.1
2026.05
4550.557.650.251
2025.01
43.45562.355.353.6
2025.01
42.657.361.756.253.9
2026.05
42.657.361.756.253.9
2025.01
41.734.558.241.544.8
2025.01
38.931.66240.144.1
2026.05
38.931.66240.144.1
2025.01
38.132.85839.543
2025.12
37.748.760.450.1-
2025.01
35.641.15944.545.2
2025.01
35.625.959.935.840.4
2026.05
35.641.15944.545.2
2025.01
33.942.354.944.143.7
2025.01
33.338.456.642.942.8
2026.05
33.338.456.642.942.8
2025.01
32.220.157.129.636.5
2025.01
31.619.151.128.333.9
2025.01
31.131.856.237.539.7
2026.05
31.131.856.237.539.7
2026.05
18.68.922.116.516.5
2026.05
7.227.118.61917.6