Visual Instruction Following

Benchmarks

Dataset Name	SOTA Method	Metric
LLaVA-Bench Wild	AutoV	Score102.3	71	2mo ago
FronTalk Multi-Turn 1.0 (test)		PR Score75	32	4mo ago
LLaVA-W		Score102	28	4mo ago
FronTalk Single-Turn 1.0 (test)		PR77.5	20	4mo ago
RefCOCO	LIVE	Accuracy47.8	16	1mo ago
SUN Instructive Visual Benchmark	LIVE	Accuracy44.68	16	1mo ago
Caltech 101 Instructive Visual Benchmark	LIVE	Accuracy38.74	16	1mo ago
ImageNet Instructive Visual Benchmark	LIVE	Accuracy85	16	1mo ago
LLaVA-Bench	DPO	Overall Score79.1	15	3mo ago
Visual Instruction Total (test)	PromptEnhancer	Avg. Response Length (Words)153.04	6	4mo ago
LLaVA-Bench 100 images (test)	Nullu	Accuracy6.53	6	4mo ago
Visual Instruction Out-Of-Distribution - Hard (test)	BeautifulPrompt	Win Ratio (Human)88	5	4mo ago
Visual Instruction Out-Of-Distribution - Simple (test)	BeautifulPrompt	Human Win Ratio93	5	4mo ago
Visual Instruction Out-Of-Distribution (test)	BeautifulPrompt	Human Win Ratio90	5	4mo ago
Visual Instruction In-Distribution - Hard (test)	BeautifulPrompt	Human Win Ratio89	5	4mo ago
Visual Instruction In-Distribution - Simple (test)	BeautifulPrompt	Human Win Ratio89	5	4mo ago
Visual Instruction In-Distribution (test)	BeautifulPrompt	Human Win Ratio89	5	4mo ago

Showing 17 of 17 rows