Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reward Modeling on Anthropic HH (test)

68.49Accuracy

Dahoas/gptj-rm-static

44.15450.47256.7963.108Apr 11, 2023
Updated 4d ago

Evaluation Results

MethodLinks
68.49
2023.04
61.75
2023.04
46.03
2023.04
45.13
2023.04
45.09