Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Personalized Response Generation on RPEVAL
Loading...
24
Macro Accuracy
RP-Reasoner
0.392
6.521
12.65
18.779
Jan 23, 2026
Macro Accuracy
Micro Accuracy
Judge (Error Severity)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro Accuracy
Micro Accuracy
Judge (Error Severity)
RP-Reasoner
Reasoning=Rational Per...
2026.01
24
52.2
2.493
CoT
Prompting=Chain-of-Tho...
2026.01
6.7
39.5
3.487
Vanilla
Prompting=Vanilla
2026.01
1.3
40
3.533
Reminder
Prompting=Reminder
2026.01
1.3
39.5
3.58
Feedback
Search any
task
Search any
task