Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Role-play dialogue comprehension on SocialBench

96.6Role Knowledge

Qwen3-8B + CRPO

56.45666.87877.387.722May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
96.692.248.835799792.795.585.280.2
2026.05
96.692.248.835799792.795.585.280.2
2026.05
95.189.645.2287294.391.394.68276.9
2026.05
95.189.645.2287294.391.394.68276.9
2026.05
94.988.444.9287694.191.994.285.477.5
2026.05
94.988.444.9287694.191.994.285.477.5
2026.05
94.989.645.2238591.491.995.583.177.7
2026.05
94.688.151.1238793.692.793.48278.4
2026.05
94.389.544.72881919195.588.178.1
2026.05
94.287.648.9288691.591.393.882.678.2
2026.05
94.290.346.3286893.491.694.683.176.6
2026.05
94.287.648.9288691.591.393.882.678.2
2026.05
94.289.245.5287792.692.796.383.177.6
2026.05
94.290.346.3286893.491.694.683.176.6
2026.05
93.888.845.7237491.892.695.58776.9
2026.05
93.888.845.7237491.892.695.58776.9
2026.05
93.288.852.7288194.192.293.884.878.7
2026.05
93.288.852.7288194.192.293.884.878.7
2026.05
93.18746.225.677.288.191.693.983.176.2
2026.05
93.18746.225.677.288.191.693.983.176.2
2026.05
92.683.747.9238484.791.392.184.876
2026.05
92.480.443.526.768.878.586.692.577.571.9
2026.05
92.480.443.526.768.878.586.692.577.571.9
2026.05
91.381.448.7238185.588.189.678.874.1
2026.05
91.381.448.7238185.588.189.678.874.1
2026.05
91.183.248.52580.792.988.991.581.776
2026.05
91.183.248.52580.792.988.991.581.776
89.980.834.329.29284.284.389.974.273.2
89.980.834.329.29284.284.389.974.273.2
89.372.846.726.38748.483.790.385.770
89.372.846.726.38748.483.790.385.770
2026.05
86.875.743.468.38445.981.483.176.471.7
86.875.743.468.38445.981.483.176.471.7
2026.05
85.57141.831.77160.579.885.279.167.3
2026.05
85.570.442.642.9806976.98477.569.9
2026.05
85.270.440.328.67053.477.585.176.565.2
2026.05
85.270.140.223.3756978.586.576.967.2
2026.05
85.170.740.227.9736979.284.878.667.6
2026.05
84.772.740.526.77356.477.681.667.664.5
2026.05
84.770.542.1307564.573.18473.666.4
2026.05
84.172.240.91873558285.373.864.9
2026.05
82.262.239.445.87569.850.673.415.957.1
82.262.239.445.87569.850.673.415.957.1
2026.05
81.674.942.433.86565.775.681.965.465.1
81.674.942.433.86565.775.681.965.465.1
2026.05
79.671.838.6216962.275.882.971.363.6
2026.05
79.173.538.333.37051.681.183.563.763.8
2026.05
78.170.541.529.673.551.273.779.763.562.4
2026.05
7870.241.621.77144.771.58163.760.4
2026.05
77.964.138.952.57259.169.982.346.762.6
2026.05
77.964.138.952.57259.169.982.346.762.6
2026.05
77.670.242.430.47454.276.682.77264.5
2026.05
77.16539.829.657.644.672.881.66058.7
74.779.426.241.281.168.284.470.436.362.5
2026.05
74.773.54030.47349.476.983.566.263.1
74.779.426.241.281.168.284.470.436.362.5
74.271.52940.85743.157.772.229.152.7
74.271.52940.85743.157.772.229.152.7
2026.05
73.572.540.435.47350.476.980.566.963.3
5825.340.831.36746.460.967.148.449.4
5825.340.831.36746.460.967.148.449.4