Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dimensional Aspect Sentiment Regression on JPN Hotel
Loading...
0.7188
RMSE (VA)
GPT-OSS 120B
0.621928
1.275814
1.9297
2.583586
Mar 5, 2026
RMSE (VA)
Updated 1mo ago
Evaluation Results
Method
Method
Links
RMSE (VA)
GPT-OSS 120B
Evaluation Protocol=Su...
2026.03
0.7188
Kimi K2 Thinking
Evaluation Protocol=On...
2026.03
1.7553
GPT-5 mini
Evaluation Protocol=On...
2026.03
2.1607
Qwen 3 14B
Evaluation Protocol=Su...
2026.03
2.2906
Mistral 3 14B
Evaluation Protocol=Su...
2026.03
2.2999
Kimi K2 Thinking
Evaluation Protocol=Ze...
2026.03
2.3294
Llama 3.3 70B
Evaluation Protocol=Su...
2026.03
2.6255
GPT-5 mini
Evaluation Protocol=Ze...
2026.03
3.1406
Feedback
Search any
task
Search any
task