Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Item Ranking on RecRM-Bench
Loading...
86.78
Accuracy
Ours
27.2608
42.7129
58.165
73.6171
May 12, 2026
Accuracy
AUC
HR
Updated 21d ago
Evaluation Results
Method
Method
Links
Accuracy
AUC
HR
Ours
2026.05
86.78
86.32
83.67
LongCat-Flash-Chat
Thinking=false
2026.05
62.48
54.93
82.93
Deepseek-V3.2
Thinking=false
2026.05
52.32
50.57
82.53
Qwen3-Max
Thinking=true
2026.05
50.16
50.01
78.42
Qwen3-Max
Thinking=false
2026.05
50.11
50.18
77.17
GPT-4.1
2026.05
42.54
51.32
75.37
LongCat-Flash-Thinking
Thinking=true
2026.05
34.09
57.89
70
Deepseek-V3.2
Thinking=true
2026.05
29.55
57.04
72.42
Feedback
Search any
task
Search any
task