Share your thoughts, 1 month free Claude Pro on usSee more

Reasoning evaluation on 109-sample (test)

97.25Accuracy

Universe Routing

Updated 4mo ago

Evaluation Results

Method	Links
Universe Routing 2026.03		97.25	16	1	-
Qwen3-Next 2026.03		96.33	3,640	228	1
GLM-4.7 2026.03		95.28	12,392	775	0.131
Kimi-K2.5 2026.03		94.5	9,875	617	0.371
Cogito-2.1 2026.03		94.44	1,413	88	0.289
GPT-OSS 2026.03		91.74	1,986	124	0.077
DeepSeek-v3.1 2026.03		87.96	4,090	256	0.01