Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
NLI on MultiMed-X KO
Loading...
0.6733
Accuracy
MED-COREASONER
0.597068
0.616859
0.63665
0.656441
Jan 13, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
MED-COREASONER
backbone=GPT-5.1
2026.01
0.6733
GPT-5.2
2026.01
0.6467
GPT-5.1
2026.01
0.6267
GPT-4o
2026.01
0.6133
Claude-3.5-haiku
2026.01
0.6
Feedback
Search any
task
Search any
task