Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Human Preference Optimization on REFL (300 random images)

1.9088ImageReward

Gradient Ascent

0.0840160.5577581.03151.505242Sep 30, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
1.90880.22470.28775.57750.14740.9195
1.89140.25260.29046.10880.17170.9242
1.70530.23620.27275.63310.29270.8401
1.59950.22260.23565.49510.55030.7225
1.59880.23230.2655.82760.28750.8505
2025.09
0.15420.23850.28876.051601