Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Prompt Optimization Evaluation on HelpSteer2

0.5072Helpfulness

MA-SAPO

0.1461120.2398560.33360.427344Oct 18, 2025
Updated 17d ago

Evaluation Results

MethodLinks
2025.10
0.50720.60380.85270.52440.7570.649
2025.10
0.49030.57450.86420.41610.65670.6003
2025.10
0.47910.55690.82680.40950.62340.5791
2025.10
0.440.51750.82210.40250.59920.5563
2025.10
0.42960.50190.79940.39570.61810.5482
2025.10
0.41670.50490.80670.40840.67080.5615
2025.10
0.40050.47540.82940.48330.84410.6065
2025.10
0.3990.47110.79890.37220.58140.5245
2025.10
0.39710.45320.78980.39980.64390.5368
2025.10
0.36160.470.77230.3280.49130.4846
2025.10
0.29810.39580.71690.28880.47090.4341
2025.10
0.26160.36360.70780.29420.44210.4139
2025.10
0.25550.33030.7050.32670.44410.4123
2025.10
0.160.25450.62910.24210.38120.3334