Share your thoughts, 1 month free Claude Pro on usSee more

Multiple-choice Reasoning on GPQA (test)

65.7Accuracy

RouteGoT

Updated 4mo ago

Evaluation Results

Method	Links
RouteGoT 2026.03		65.7	3,352
AGoT 2026.03		64.6	12,179
CoT 2026.03		63.1	4,965
KNN 2026.03		61.1	3,292
Random 2026.03		60.1	5,658
GoT* 2026.03		59.6	9,468
EmbedLLM 2026.03		59.6	11,369
RouteLLM 2026.03		57.6	3,640
RTR 2026.03		56.6	3,224
ToT 2026.03		44.9	9,077
IO 2026.03		41.4	7