Share your thoughts, 1 month free Claude Pro on usSee more

Natural Language Inference on MultiMed-X ZH

0.7667Accuracy

MED-COREASONER

Updated 5mo ago

Evaluation Results

Method	Links
MED-COREASONER 2026.01		0.7667
GPT-5.1 2026.01		0.7467
GPT-4o 2026.01		0.7467
GPT-5.2 2026.01		0.74
Claude-3.5-haiku 2026.01		0.6933