Share your thoughts, 1 month free Claude Pro on usSee more

Chinese Language Understanding on C-Eval (EM)

91.8Exact Match

OpenAI-o1-1217

Updated 4mo ago

Evaluation Results

Method	Links
OpenAI-o1-1217 2025.01		91.8
DeepSeek-R1 2025.01		91.8
DeepSeek-V3 2025.01		86.5
Claude-3.5-Sonnet-1022 2025.01		76.7
GPT-4o-0513 2025.01		76
OpenAI-o1-mini 2025.01		68.9