Share your thoughts, 1 month free Claude Pro on usSee more

Logical Reasoning on CLUTRR gen_train234_test2to10

25Accuracy

Llama-3.1-8B-it (w/ DeepSeek-R1)

Updated 22d ago

Evaluation Results

Method	Links
Llama-3.1-8B-it (w/ DeepSeek-R1) 2025.06		25
Llama-3.1-8B-it (w/ SLR) 2025.06		16.4
Llama-3.1-8B-it (Base) 2025.06		9.5