Share your thoughts, 1 month free Claude Pro on usSee more

Binary comparison for commonsense plausibility on CoDa 1.0 (test)

95.39Accuracy

EVA-CLIP

Updated 3mo ago

Evaluation Results

Method	Links
EVA-CLIP 2025.02		95.39
Mistral 2025.02		94.97
Qwen2 2025.02		94.71
GPT-4 2025.02		94.63
GPT-3.5 2025.02		94.05