Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logical Reasoning on CLUTRR gen_train234_test2to10

25Accuracy

Llama-3.1-8B-it (w/ DeepSeek-R1)

8.8813.06517.2521.435Jun 18, 2025
Updated 22d ago

Evaluation Results

MethodLinks
2025.06
25
2025.06
16.4
2025.06
9.5