Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Nonmonotonic reasoning on MultiLogicNMR

100Skeptical Accuracy

Gemini 2.5 Pro+ASP

36.87253.26169.6586.039Apr 30, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
10093.8
2026.04
98.398.3
2026.04
95.884
2026.04
88.561
2026.04
83.873.5
2026.04
74.355.3
2026.04
67.569.7
2026.04
57.751
2026.04
42.336.2
2026.04
39.346.8