Share your thoughts, 1 month free Claude Pro on usSee more

Artifact Explanation on ArtiBench (test)

23.3ROUGE

Qwen2.5-VL-7B + ArtiAgent

Updated 4mo ago

Evaluation Results

Method	Links
Qwen2.5-VL-7B + ArtiAgent 2026.02		23.3	64.3
InternVL3.5-8B + ArtiAgent 2026.02		22.6	62.5
Gemini-2.5-Pro 2026.02		15.9	42
GPT-5 2026.02		14.5	43.4
LEGION 2026.02		14.3	33.2
GPT-4o 2026.02		14.3	43.3
InternVL3.5-8B 2026.02		12.6	25.6
Qwen2.5-VL-7B 2026.02		11.7	26.3