Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Conversational Recommendation on MUSE Multimodal Fashion (test)
Loading...
10.2
R@1
HARPO
1.256
3.578
5.9
8.222
Apr 11, 2026
R@1
R@10
R@50
MRR@10
NDCG@10
User Satisfaction
Engagement
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
R@1
R@10
R@50
MRR@10
NDCG@10
User Satisfaction
Engagement
Average Score
HARPO
2026.04
10.2
38.6
58.4
19.8
26.4
72
68
52.4
Qwen2-VL-7B
fine-tuning_protocol=W...
2026.04
8.4
34.2
52.8
17.2
23.1
61
57
46.8
LLaVA-Next-8B
fine-tuning_protocol=W...
2026.04
5.2
25.4
44.2
12
16.2
52
48
38
GPT-4V
2026.04
4.4
23.2
42.6
10.8
14.8
54
50
37.4
UniCRS
adaptation_type=text-only
2026.04
1.6
11.8
27.4
5.1
7.2
36
32
22.9
Feedback
Search any
task
Search any
task