Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Visual Commonsense Reasoning (Q→A) on VCR (test)

61.49Q->A Accuracy (3%, 1000 SP/C)

VILLA (MAD*) [ROBERTa/CLIP-V]

32.224439.822247.4255.0178Apr 22, 2022
Updated 4d ago

Evaluation Results

MethodLinks
2022.04
61.49-43.1178.91
2022.04
60.93-42.9578.83
2022.04
60.8826.7842.2377.05
2022.04
58.9826.840.4377.61
2022.04
58.3840.73-78.59
2022.04
57.01-34.9378.27
2022.04
53.5434.63-68.36
2022.04
39.0636.78-55.91
2022.04
38.154.8238.2354.23
2022.04
36.34-34.4136.42
2022.04
35.0239.43-57.64
2022.04
34.8523.3730.8553.48
2022.04
33.3524.7831.4354.24