Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Human Preference Evaluation

Benchmarks

Task NameDataset NameSOTA ResultTrend
Image GenerationHuman Preference Evaluation 55 prompts
Votes500
6
Human Preference EvaluationHuman Preference Evaluation 371 prompts (test)
Recall @139.89
3
Human Preference EvaluationHuman Preference Evaluation 466 prompts (test)
Preference Accuracy65.14
3
Showing 3 of 3 rows