Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Binary comparison for commonsense plausibility on ViComTe Material 1.0 (test)
Loading...
91.27
Accuracy
Mistral
88.6596
89.3373
90.015
90.6927
Feb 19, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Mistral
Framework=ComPaSS, Tra...
2025.02
91.27
EVA-CLIP
Framework=ComPaSS
2025.02
90.79
Qwen2
Framework=ComPaSS, Tra...
2025.02
90.42
GPT-3.5
2025.02
89.6
GPT-4
2025.02
88.76
Feedback
Search any
task
Search any
task