Share your thoughts, 1 month free Claude Pro on usSee more

Visual Question Answering on PathMMU All-tiny (test)

77.1Accuracy

PathChat+

Updated 3mo ago

Evaluation Results

Method	Links
PathChat+ 2025.06		77.1	74.7
GPT-5 2025.06		71.9	69.2
Expert performance 2025.06		71.8	-
Gemini 2.5 pro 2025.06		71.4	69
GPT-5-mini 2025.06		68.3	65.6
PathChat 1 2025.06		64	61.3
Claude Sonnet 4 2025.06		61	58.1
HuatuoGPT-Vision 2025.06		58.7	56.1
Qwen3-VL 2025.06		57.1	54.3
Llama-3.2 2025.06		53	50.1
LLaVA-OneVision 2025.06		48.9	45.9
Quilt-LLaVA 2025.06		44.1	41.3
PA-LLaVA 2025.06		42.2	39.6