Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Instruction Tuning on Anthropic HH-RLHF (test)

-Average Reward Score

No plottable results for Average Reward Score (SCALAR).
Updated 4d ago

Evaluation Results

MethodLinks
No evaluation results found.