Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Binary comparison for commonsense plausibility on ViComTe Color 1.0 (test)
Loading...
93.29
Accuracy
GPT-4
85.7708
87.7229
89.675
91.6271
Feb 19, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4
2025.02
93.29
EVA-CLIP
Framework=ComPaSS
2025.02
93.29
GPT-3.5
2025.02
92.25
Qwen2
Framework=ComPaSS, Tra...
2025.02
86.79
Mistral
Framework=ComPaSS, Tra...
2025.02
86.06
Feedback
Search any
task
Search any
task