Share your thoughts, 1 month free Claude Pro on usSee more

Differences Caption Generation on MagicBrush (test)

50Main Difference Count

GPT-4o

Updated 5mo ago

Evaluation Results

Method	Links
GPT-4o 2025.06		50	11	12	61	57	1.9	0
Qwen2.5 VL 2025.06		37	11	1	63	59	1.5	60
GPT-4 2025.06		34	8	8	75	74	2.5	80
GPT-4 Turbo 2025.06		34	7	8	78	75	1.5	450
InternVL3 2025.06		25	8	11	88	83	3.4	230
LLaVA 2025.06		12	-	-	-	-	-	-
LLaVA 2025.06		5	-	-	-	-	-	-