Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dimensional Aspect Sentiment Regression on UKR Restaurant
Loading...
1.5166
RMSE_VA
GPT-OSS 120B
1.474428
1.759089
2.04375
2.328411
Mar 5, 2026
RMSE_VA
Updated 1mo ago
Evaluation Results
Method
Method
Links
RMSE_VA
GPT-OSS 120B
Evaluation Protocol=Su...
2026.03
1.5166
Kimi K2 Thinking
Evaluation Protocol=On...
2026.03
1.7805
GPT-5 mini
Evaluation Protocol=On...
2026.03
2.0438
Kimi K2 Thinking
Evaluation Protocol=Ze...
2026.03
2.0782
Qwen 3 14B
Evaluation Protocol=Su...
2026.03
2.2121
Mistral 3 14B
Evaluation Protocol=Su...
2026.03
2.4592
GPT-5 mini
Evaluation Protocol=Ze...
2026.03
2.5628
Llama 3.3 70B
Evaluation Protocol=Su...
2026.03
2.5709
Feedback
Search any
task
Search any
task