Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Common Sense Reasoning on SWAG

92.29Accuracy

LD-MoLE

62.348470.121777.89585.6683May 16, 2025Jun 7, 2025Jun 30, 2025Jul 23, 2025Aug 15, 2025Sep 7, 2025Sep 30, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.09
92.29
2025.09
92.22
2025.09
91.37
2025.09
90.45
2025.09
89.15
2025.09
86.97
2025.09
86.72
2025.09
86.37
2025.09
84.17
2025.09
84.11
2025.09
83.96
2025.09
83.56
2025.05
73.3
2025.05
72.7
2025.05
71.9
2025.05
71.64
2025.05
71
2025.05
70
2025.05
68.99
2025.05
68.95
2025.05
66.1
2025.05
65.75
2025.05
65.58
2025.05
63.5