Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Recommendation Explanation Evaluation on Human Expert Evaluation Set Aggregated (test)
Loading...
4.38
Helpfulness Score
MATRAG
3.0696
3.4098
3.75
4.0902
Feb 11, 2026
Helpfulness Score
Trustworthiness Score
Informativeness Score
Personalization Score
Average Evaluation Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Helpfulness Score
Trustworthiness Score
Informativeness Score
Personalization Score
Average Evaluation Score
MATRAG
2026.02
4.38
4.31
4.52
4.24
4.36
G-CRS
2026.02
3.91
3.82
4.08
3.67
3.87
K-RagRec
2026.02
3.78
3.67
3.92
3.51
3.72
MACRec
2026.02
3.56
3.42
3.71
3.28
3.49
Chat-Rec
2026.02
3.12
2.98
3.34
2.87
3.08
Feedback
Search any
task
Search any
task