Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning and Persona Consistency on OPeRA (test)
Loading...
5.3
Pages per Session
Human (OPeRA)
2.284
3.067
3.85
4.633
Jan 2, 2026
Pages per Session
Thought-Action Consistency
Persona-Behavior Consistency
Purchase Rate Gap
Updated 4d ago
Evaluation Results
Method
Method
Links
Pages per Session
Thought-Action Consistency
Persona-Behavior Consistency
Purchase Rate Gap
Human (OPeRA)
2026.01
5.3
-
-
-
ALIGNUSER
counterfactual reflect...
2026.01
5.1
86.7
82.4
2.5
ALIGNUSER+
counterfactual reflect...
2026.01
5.1
89.3
85.6
2.1
SimUSER
2026.01
4.6
64.3
61.5
9.9
Agent4Rec
2026.01
4
55.8
52.4
12.1
RecAgent
2026.01
3.5
49.5
46.7
16.3
Random
2026.01
2.4
38.7
36.1
22.8
Feedback
Search any
task
Search any
task