Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Chinese Reasoning on C3
Loading...
96.88
Accuracy
Llama-3.3-70B-Instruct
22.624
41.902
61.18
80.458
Apr 30, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Llama-3.3-70B-Instruct
Shots=3
2026.04
96.88
Qwen3.5-9B
Shots=3
2026.04
95.51
Qwen3-14B
Shots=3
2026.04
94.74
SecGPT-14B
Shots=3
2026.04
93.21
Qwen3-8B
Shots=3
2026.04
92
XekRung-8B
Shots=3
2026.04
91.18
Llama-3.1-8B-Instruct
Shots=3
2026.04
28.38
Foundation-Sec-8B-Reasoning
Shots=3
2026.04
25.48
Feedback
Search any
task
Search any
task