Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logical Reasoning on CLUTRR rob train clean 23 all (test)

35.6Accuracy

Llama-3.1-8B-it (w/ SLR)

25.61628.20830.833.392Jun 18, 2025
Updated 22d ago

Evaluation Results

MethodLinks
2025.06
35.6
2025.06
29.1
2025.06
26