Share your thoughts, 1 month free Claude Pro on usSee more

Large Multi-modal Model Evaluation on LLaVA-Bench In-the-Wild v1

65.5Conversational Score

LLaVA-Plus

Updated 5mo ago

Evaluation Results

Method	Links
LLaVA-Plus 2023.11		65.5	56.8	79.1	69.5
LLaVA-Plus 2023.11		45.2	50.4	72.6	59.1
LLaVA 2023.11		42.6	51.9	68.9	57.1
LLaVA 2023.11		40.7	48.1	51.2	47.5
LLaVA-Plus 2023.11		38.8	39.8	59.8	48.7
GPT4Tools 2023.11		31.1	27.1	54.1	40.7