Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Medical Question Answering on PMC v1.0 (test)

0.5855Accuracy

GPT-4o

0.3330920.3986210.464150.529679Aug 4, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.08
0.5855
2025.08
0.5437
2025.08
0.5339
2025.08
0.5328
2025.08
0.5254
2025.08
0.5205
2025.08
0.519
2025.08
0.5067
2025.08
0.493
2025.08
0.4732
2025.08
0.4477
2025.08
0.4442
2025.08
0.4273
2025.08
0.3675
2025.08
0.3428