Share your thoughts, 1 month free Claude Pro on usSee more

Logical Reasoning on CLUTRR gen_train23_test2to10

24Accuracy

Llama-3.1-8B-it (w/ DeepSeek-R1)

Updated 22d ago

Evaluation Results

Method	Links
Llama-3.1-8B-it (w/ DeepSeek-R1) 2025.06		24
Llama-3.1-8B-it (w/ SLR) 2025.06		19.1
Llama-3.1-8B-it (Base) 2025.06		10.2