Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logical Reasoning on CLUTRR gen_train23_test2to10

24Accuracy

Llama-3.1-8B-it (w/ DeepSeek-R1)

9.64813.37417.120.826Jun 18, 2025
Updated 22d ago

Evaluation Results

MethodLinks
2025.06
24
2025.06
19.1
2025.06
10.2