Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dimensional Aspect Sentiment Regression on RUS Restaurant
Loading...
1.4775
RMSE_VA
GPT-OSS 120B
1.434812
1.722956
2.0111
2.299244
Mar 5, 2026
RMSE_VA
Updated 1mo ago
Evaluation Results
Method
Method
Links
RMSE_VA
GPT-OSS 120B
Evaluation Protocol=Su...
2026.03
1.4775
Kimi K2 Thinking
Evaluation Protocol=On...
2026.03
1.7768
GPT-5 mini
Evaluation Protocol=On...
2026.03
2.039
Kimi K2 Thinking
Evaluation Protocol=Ze...
2026.03
2.063
Qwen 3 14B
Evaluation Protocol=Su...
2026.03
2.1528
Mistral 3 14B
Evaluation Protocol=Su...
2026.03
2.3617
Llama 3.3 70B
Evaluation Protocol=Su...
2026.03
2.5089
GPT-5 mini
Evaluation Protocol=Ze...
2026.03
2.5447
Feedback
Search any
task
Search any
task