Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Chinese Language Understanding on C-Eval (EM)
Loading...
91.8
Exact Match
OpenAI-o1-1217
67.984
74.167
80.35
86.533
Jan 22, 2025
Exact Match
Updated 4d ago
Evaluation Results
Method
Method
Links
Exact Match
OpenAI-o1-1217
2025.01
91.8
DeepSeek-R1
Architecture=MoE, Acti...
2025.01
91.8
DeepSeek-V3
Architecture=MoE, Acti...
2025.01
86.5
Claude-3.5-Sonnet-1022
2025.01
76.7
GPT-4o-0513
2025.01
76
OpenAI-o1-mini
2025.01
68.9
Feedback
Search any
task
Search any
task