Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dimensional Aspect Sentiment Regression on JPN Finance
Loading...
1.0188
RMSE_VA
GPT-OSS 120B
0.952512
1.399956
1.8474
2.294844
Mar 5, 2026
RMSE_VA
Updated 1mo ago
Evaluation Results
Method
Method
Links
RMSE_VA
GPT-OSS 120B
Evaluation Protocol=Su...
2026.03
1.0188
Kimi K2 Thinking
Evaluation Protocol=On...
2026.03
1.6396
Qwen 3 14B
Evaluation Protocol=Su...
2026.03
1.8964
GPT-5 mini
Evaluation Protocol=On...
2026.03
1.9243
Mistral 3 14B
Evaluation Protocol=Su...
2026.03
2.07
Kimi K2 Thinking
Evaluation Protocol=Ze...
2026.03
2.3379
Llama 3.3 70B
Evaluation Protocol=Su...
2026.03
2.4191
GPT-5 mini
Evaluation Protocol=Ze...
2026.03
2.676
Feedback
Search any
task
Search any
task