Visual-CoT

Benchmarks

Task Name	Dataset Name	SOTA Result	Trend
Multimodal Vision-Language Reasoning	Visual CoT benchmark	DocV Score83.3		13
Visual Grounding	Visual-CoT	Error Rate22.3		6

Showing 2 of 2 rows