Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Alignment on HH-RLHF 300 prompts

69.8Win/Tie Rate vs Vanilla (GPT-4o)

CARDS

49.20854.55459.965.246Nov 5, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.11
69.8
2025.11
64.5
2025.11
64.5
2025.11
60.5
2025.11
60.2
2025.11
59.2
2025.11
59
2025.11
58.8
2025.11
56.4
2025.11
55.2
2025.11
55
2025.11
54.8
2025.11
50.4
2025.11
50.2
2025.11
50
2025.11
50