Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Music Recommendation on Music streaming platform A/B test successive deployments (online)
Loading...
2.26
Total Listening Time
ARGUS
0.2424
0.7662
1.29
1.8138
Jul 21, 2025
Total Listening Time
Like Likelihood
Updated 4d ago
Evaluation Results
Method
Method
Links
Total Listening Time
Like Likelihood
ARGUS
Context Length=8192, E...
2025.07
2.26
6.37
Offline V2
Context Length=1024, E...
2025.07
1
0.73
Offline V3
Context Length=1024, E...
2025.07
0.73
5
Offline V1
Context Length=512, En...
2025.07
0.52
1.11
Real-time V1
Context Length=1024, E...
2025.07
0.32
1.38
Feedback
Search any
task
Search any
task