Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Commonsense Reasoning on CommonsenseQA (LLMcritic Metrics)

15.54LLMcritic Calls

VecCISC + HAC

11.6412.652513.66514.6775May 8, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.05
15.54-22.31
2026.05
15.54-22.31
2026.05
15.23-23.83
2026.05
13.81-30.95
2026.05
12.7-36.5
2026.05
12.7-36.5
2026.05
12.45-37.76
2026.05
11.89-40.56
2026.05
11.79-41.02
2026.05
11.79-41.02