Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Symbolic Reasoning on Symbolic Longer

0.187Accuracy (Clean, Avg)

SCO

0.06740.098450.12950.16055Oct 31, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.10
0.1870.1210.1050.1130.1130.1520.1590.0980.136
2024.10
0.1230.120.120.130.1230.1230.10.110.111
2024.10
0.0940.0980.0790.0790.0850.0850.0740.0650.075
2024.10
0.0920.0630.0720.060.0650.070.0680.060.066
2024.10
0.0720.0340.0350.0250.0310.0380.0360.0360.037