Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLaVA-W

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal ReasoningLLaVA-W (test)
Accuracy90.6
12
Out-of-Distribution General Visual Question AnsweringLLaVA-W
Score0.755
6
Vision-to-TextLLAVA-W English
Score111.9
3
Showing 3 of 3 rows