Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Table Reasoning Accuracy on TableBench

63.48Accuracy

SERO

27.901637.138346.37555.6117May 27, 2026May 28, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.05
63.48-
2026.05
61.6-
2026.05
60.68-
2026.05
60.64-
2026.05
59-
2026.05
58.5-
2026.05
57.91-
2026.05
54.8542.4
2026.05
54.06-
2026.05
53.6540.5
2026.05
52.43-
2026.05
52.3-
2026.05
51.7728.1
2026.05
51.76-
2026.05
50.5230.1
2026.05
50.4127.4
2026.05
50.3329.4
2026.05
50.0229.9
2026.05
49.8729.7
2026.05
49.5428.1
2026.05
49.37-
2026.05
49.2229.1
2026.05
49.2130.4
2026.05
49.08-
2026.05
48.8928.4
2026.05
48.8128.6
2026.05
48.7428.3
2026.05
48.32-
2026.05
47.99-
2026.05
47.95-
2026.05
47.24-
2026.05
46.1236.7
2026.05
44.43-
2026.05
44.05-
2026.05
44.0336.4
2026.05
42.556.12
2026.05
42.556.12
2026.05
29.44-
2026.05
29.27-