Share your thoughts, 1 month free Claude Pro on usSee more

Commonsense Reasoning on Com^2-hard Intervention (test)

54.77Accuracy

Generator Baseline

Updated 4mo ago

Evaluation Results

Method	Links
Generator Baseline 2026.02		54.77
MENTORCOLLAB FREE 2026.02		54.77
MENTORCOLLAB MLP 2026.02		13.69
CoSD 2026.02		8.3
R-Stitch 2026.02		4.98