Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Negotiation vs GPT-5.4 High Reasoning Seller on Standard Held-Out Test Set

0.4081Reward

Qwen3-30B-A3B-Instruct-2507-trained

0.0939160.1754830.257050.338617Apr 10, 2026
Updated 5d ago

Evaluation Results

MethodLinks
0.40817540.810
0.274460.53.8932.4
2026.04
0.182391.418.230
2026.04
0.145884.816.141.6
2026.04
0.122392.613.020.8
2026.04
0.120490.614.772.7
2026.04
0.10690.612.561.2