Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Model Alignment Evaluation on Stoic Alignment Benchmark

32.24Mean Score

Qwen3 Few-shot

17.066421.005724.94528.8843May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
32.2432.422.171000.2231.81--
2026.05
30.93312.251000.2330.490.001-
2026.05
29.329.392.161000.2228.880.001-
2026.05
28.9829.222.611000.2628.470.001-
2026.05
28.5428.782.211000.2228.10.001-
2026.05
28.4628.562.011000.228.070.001-
2026.05
28.3728.392.341000.2327.920.001-
2026.05
27.7928.062.761000.2827.250.001-
2026.05
26.1126.783.581000.3625.410.001-
2026.05
25.1325.833.231000.3224.490.001-
2026.05
21.5622.444.451000.4520.690.001-
2026.05
21.4921.724.031000.420.70.001-
2026.05
19.8220.394.341000.4318.970.001-
2026.05
18.6619.174.521000.4517.780.001-
2026.05
18.5518.724.061000.4117.760.001-
2026.05
17.6517.834.151000.4216.830.001-