Share your thoughts, 1 month free Claude Pro on usSee more

Zero-shot Commonsense Reasoning on Zero-shot tasks (test)

61Average Accuracy

Original

Updated 4mo ago

Evaluation Results

Method	Links
Original 2026.02		61
SVD-LLM v2 2026.02		60
COMPOT 2026.02		60
Original 2026.02		59
COMPOT 2026.02		57
SVD-LLM 2026.02		57
SVD-LLM v2 2026.02		56
SVD-LLM 2026.02		55
ASVD 2026.02		54
ASVD 2026.02		44
FWSVD 2026.02		43
FWSVD 2026.02		42