Share your thoughts, 1 month free Claude Pro on usSee more

Natural Language Inference on MultiMed-X EN

78.67Accuracy

GPT-4o

Updated 5mo ago

Evaluation Results

Method	Links
GPT-4o 2026.01		78.67
MED-COREASONER 2026.01		77.33
GPT-5.2 2026.01		76.67
GPT-5.1 2026.01		76.67
Claude-3.5-haiku 2026.01		69.33