Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TableBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Table Question AnsweringTableBench
EM62
40
Table ReasoningTableBench
Pass@172.52
21
Table Chain of Thought ReasoningTableBench
Rge54.28
13
Symbolic Chain of Thought ReasoningTableBench
Rge1.99
13
Program of Thought ReasoningTableBench
Rge Score51.96
13
Data ProcessingTableBench
Rge52.18
13
Complex Tabular ReasoningTableBench
TB-NR0.788
11
Table-QATableBench
Quality Score33.6
4
Showing 8 of 8 rows