Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Recommendation on LastFM (test)
Loading...
22.12
Hit@1
CoARS
0.02
5.7575
11.495
17.2325
Apr 11, 2026
Hit@1
Updated 5d ago
Evaluation Results
Method
Method
Links
Hit@1
CoARS
Backbone=Qwen3-8B
2026.04
22.12
RecoWorld
Backbone=Qwen3-8B
2026.04
19.85
CoARS
Backbone=Qwen3-4B
2026.04
18.38
iAgent
Backbone=GPT-5.4-mini
2026.04
15.83
RecoWorld
Backbone=Qwen3-4B
2026.04
12.48
iAgent
Backbone=Qwen3-8B
2026.04
6.48
iAgent
Backbone=Qwen3-4B
2026.04
4.12
AFL
Backbone=GPT-5.4-mini
2026.04
3.01
AFL
Backbone=Qwen3-8B
2026.04
2.15
Reflexion
Backbone=GPT-5.4-mini
2026.04
1.69
AFL
Backbone=Qwen3-4B
2026.04
1.51
Reflexion
Backbone=Qwen3-8B
2026.04
0.96
Reflexion
Backbone=Qwen3-4B
2026.04
0.87
Feedback
Search any
task
Search any
task