Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multiple-choice Video Question Answering on TVQA (test)
Loading...
57.8
Accuracy
GPT-4V
35.024
40.937
46.85
52.763
Mar 27, 2024
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4V
Vision Encoder=Unknown...
2024.03
57.8
LLaVA v1.6
Vision Encoder=ViT-L,...
2024.03
51.1
IG-VLM LLaVA v1.6
Vision Encoder=ViT-L,...
2024.03
44.5
LLaVA v1.6
Vision Encoder=ViT-L,...
2024.03
42.1
VideoChat2
Vision Encoder=UMT-L,...
2024.03
40.6
CogAgent
Vision Encoder=CLIP-E,...
2024.03
38.6
Sevilla
Vision Encoder=ViT-L,...
2024.03
38.2
InternVideo
Vision Encoder=ViT-L,...
2024.03
35.9
Feedback
Search any
task
Search any
task