Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Chinese Reasoning on OCNLI
Loading...
74.4
Accuracy
Qwen3.5-9B
16.0872
31.2261
46.365
61.5039
Apr 30, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3.5-9B
Shots=5
2026.04
74.4
Llama-3.3-70B-Instruct
Shots=5
2026.04
71.59
XekRung-8B
Shots=5
2026.04
69.96
Qwen3-14B
Shots=5
2026.04
69.72
Qwen3-8B
Shots=5
2026.04
68.81
SecGPT-14B
Shots=5
2026.04
67.66
Llama-3.1-8B-Instruct
Shots=5
2026.04
43.81
Llama-Primus-Reasoning-8B
Shots=5
2026.04
30.6
Foundation-Sec-8B-Reasoning
Shots=5
2026.04
18.33
Feedback
Search any
task
Search any
task