Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Webpage Question Answering on VisualWebBench MultiUI-WQA
Loading...
89.47
Accuracy
GPT-5-nano
68.0772
73.6311
79.185
84.7389
Oct 16, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-5-nano
Model Category=Proprie...
2025.10
89.47
COGS
Approach Type=Data Syn...
2025.10
88.04
MultiUI-WQA
Approach Type=Data Syn...
2025.10
86.6
Decompositional CoT
Approach Type=Inferenc...
2025.10
86.12
Qwen2.5-VL-7B (base model)
Model Category=Opensou...
2025.10
85.65
Gemini 2.5 Flash-Lite
Model Category=Proprie...
2025.10
81.85
GPT-4o-mini
Model Category=Proprie...
2025.10
81.34
Claude Haiku 3.5
Model Category=Proprie...
2025.10
80.86
InternVL3.5-GPT-OSS
Model Category=Opensou...
2025.10
74.64
Phi-4-14B
Model Category=Opensou...
2025.10
74.16
UiX-Qwen2
Model Category=Special...
2025.10
68.9
Feedback
Search any
task
Search any
task