Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Binary comparison for commonsense plausibility on CoDa 1.0 (test)
Loading...
95.39
Accuracy
EVA-CLIP
93.9964
94.3582
94.72
95.0818
Feb 19, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
EVA-CLIP
Framework=ComPaSS
2025.02
95.39
Mistral
Framework=ComPaSS, Tra...
2025.02
94.97
Qwen2
Framework=ComPaSS, Tra...
2025.02
94.71
GPT-4
2025.02
94.63
GPT-3.5
2025.02
94.05
Feedback
Search any
task
Search any
task