Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dimensional Aspect Sentiment Regression on TAT Restaurant
Loading...
1.7153
RMSE (VA)
GPT-OSS 120B
1.66206
2.02143
2.3808
2.74017
Mar 5, 2026
RMSE (VA)
Updated 1mo ago
Evaluation Results
Method
Method
Links
RMSE (VA)
GPT-OSS 120B
Evaluation Protocol=Su...
2026.03
1.7153
Kimi K2 Thinking
Evaluation Protocol=On...
2026.03
1.938
GPT-5 mini
Evaluation Protocol=On...
2026.03
2.2308
Kimi K2 Thinking
Evaluation Protocol=Ze...
2026.03
2.3636
Qwen 3 14B
Evaluation Protocol=Su...
2026.03
2.6367
GPT-5 mini
Evaluation Protocol=Ze...
2026.03
2.6645
Llama 3.3 70B
Evaluation Protocol=Su...
2026.03
2.9165
Mistral 3 14B
Evaluation Protocol=Su...
2026.03
3.0463
Feedback
Search any
task
Search any
task