Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Image Evaluation on Airbnb Study
Loading...
5.263
Realism Score
Utility-Aware Generator
3.1882
3.72685
4.2655
4.80415
May 27, 2026
Realism Score
Uniqueness Score
Aesthetic Score
Booking Potential Score
Updated 6d ago
Evaluation Results
Method
Method
Links
Realism Score
Uniqueness Score
Aesthetic Score
Booking Potential Score
Utility-Aware Generator
Rating Scale=7-point
2026.05
5.263
4.671
5.263
5.039
Flux
Rating Scale=7-point
2026.05
5.145
4.627
4.964
4.807
OpenAI
Rating Scale=7-point
2026.05
4.198
4.527
4.989
4.582
Stable Diffusion
Rating Scale=7-point
2026.05
3.268
3.381
3.66
3.423
Feedback
Search any
task
Search any
task