Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Helpfulness on GPT-4 Evaluation Template T2 (overall)

91.6Win Rate

SafeDPO

57.571266.405675.2484.0744May 26, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
91.60.647.76
2025.05
85.511.4213.07
2025.05
75.9511.2712.78
2025.05
72.588.6718.75
2025.05
58.8816.7324.39