Share your thoughts, 1 month free Claude Pro on usSee more

MCQ Diagnostic Accuracy on Stanford Echo (test)

64Accuracy

MARCUS

Updated 4mo ago

Evaluation Results

Method	Links
MARCUS 2026.03		64	50	-
GPT-5 2026.03		34	22	-
Gemini 2.5 Pro 2026.03		22.9	10.4	-
MARCUS vs GPT-5 (McNemar p) 2026.03		-	-	0.007
MARCUS vs Gemini (McNemar p) 2026.03		-	-	0.001