Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rubric Discovery on Arena-Expert-5K, HelpSteer3, HH-RLHF, UltraFeedback cross-source mean

4.45St. Score

PREMISE (+ VFR unconstrained)

2.02682.65593.2853.9141May 29, 2026
Updated 2d ago

Evaluation Results

MethodLinks
4.4548.831.358.529.955.20.091464.866.489.9
2026.05
4.435.834.659.243.975.30.01768.169.587.8
4.3951.927.460.93657.70.091464.96790.9
4.2153.126.359.746.457.70.08156566.591.9
2026.05
3.9431.913.936.529.730.021068.763.790.8
3.8349.924.154.843.2340.081168.668.591.6
2026.05
3.4455.837.854.451.7630.05467.561.493.2
2026.05
3.3858.228.170.23382.40.06368.266.587.3
2026.05
2.1251.325.958.550.240.30.041868.265.788.6