Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logical Reasoning on PrOntoQA

91.4Calibrated Accuracy

Llama 3.1 8B

47.82459.13770.4581.763May 29, 2024Sep 24, 2024Jan 21, 2025May 20, 2025Sep 16, 2025Jan 13, 2026May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
91.4
2026.05
80.7
2026.05
77.4
2026.05
77.1
2026.05
67.8
2024.05
63.8
2026.05
63.5
2024.05
60.4
2024.05
59.3
2024.05
56.6
2024.05
56.6
2024.05
55.7
2024.05
54.5
2026.05
52.9
2024.05
50.8
2026.05
49.6
2026.05
49.5