Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Human Preference Alignment Out-of-Domain (test)
Loading...
35.3
HPS-v2.1
TAFS-GRPO
27.708
29.679
31.65
33.621
Feb 2, 2026
HPS-v2.1
ImageReward
Unified Reward
Updated 4d ago
Evaluation Results
Method
Method
Links
HPS-v2.1
ImageReward
Unified Reward
TAFS-GRPO
NFE_pi_theta_old / NFE...
2026.02
35.3
159.5
3.511
DanceGRPO
NFE_pi_theta_old / NFE...
2026.02
33.3
121.2
3.484
MixGRPO
NFE_pi_theta_old / NFE...
2026.02
32.4
121
3.472
Flow-GRPO
NFE_pi_theta_old / NFE...
2026.02
30.4
103.5
3.46
Reward-Instruct
Iteration Time (s)=206
2026.02
28.6
97.3
3.392
RG-LCD
Iteration Time (s)=466
2026.02
28.3
92.9
3.336
Flux.1-dev
2026.02
28
84.8
3.328
Feedback
Search any
task
Search any
task