Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Preference Classification on Anthropic HH Helpful (test)

57.6Accuracy

UMM-RM

44.0847.5951.154.61Nov 30, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.11
57.6
2025.11
55.2
2025.11
55
2025.11
54.8
2025.11
54.6
2025.11
54.2
2025.11
44.6