Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Helpfulness on Template T3 GPT-4 evaluation (test)

91.62Win Rate

SafeDPO

44.8256.9769.1281.27May 26, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
91.621.127.25
2025.05
67.58.7523.75
2025.05
65.8816.7517.38
2025.05
64.25287.75
2025.05
46.6235.2518.12