Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Visual Question Answering on PathMMU All-tiny (test)
Loading...
77.1
Accuracy
PathChat+
40.804
50.227
59.65
69.073
Jun 26, 2025
Accuracy
Accuracy 95% CI Lower Bound
Updated 20d ago
Evaluation Results
Method
Method
Links
Accuracy
Accuracy 95% CI Lower Bound
PathChat+
2025.06
77.1
74.7
GPT-5
2025.06
71.9
69.2
Expert performance
2025.06
71.8
-
Gemini 2.5 pro
2025.06
71.4
69
GPT-5-mini
2025.06
68.3
65.6
PathChat 1
2025.06
64
61.3
Claude Sonnet 4
2025.06
61
58.1
HuatuoGPT-Vision
2025.06
58.7
56.1
Qwen3-VL
2025.06
57.1
54.3
Llama-3.2
2025.06
53
50.1
LLaVA-OneVision
2025.06
48.9
45.9
Quilt-LLaVA
2025.06
44.1
41.3
PA-LLaVA
2025.06
42.2
39.6
Feedback
Search any
task
Search any
task