Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Program of Thought Reasoning on TableBench
Loading...
51.96
Rge Score
GPT-4o
-2.0784
11.9508
25.98
40.0092
Dec 23, 2025
Rge Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Rge Score
GPT-4o
Model Scale Group=Larger
2025.12
51.96
Qwen-Plus
Model Scale Group=Larger
2025.12
41.79
QwQ-32B
Model Scale Group=Larger
2025.12
40.03
TableGPT2-7B
Model Scale Group=Comp...
2025.12
39.8
Qwen3-32B
Model Scale Group=Larger
2025.12
37.78
Qwen3-14B
Model Scale Group=Larger
2025.12
36.61
TableGPT-R1-8B
Model Scale Group=Comp...
2025.12
35.12
DeepSeek-V3
Model Scale Group=Larger
2025.12
33.05
Qwen3-8B
Model Scale Group=Comp...
2025.12
28.01
Qwen3-72B
Model Scale Group=Larger
2025.12
27.72
Table-R1-Zero-7B
Model Scale Group=Comp...
2025.12
7.54
Llama-3.1-8B
Model Scale Group=Comp...
2025.12
6.73
TableLLM
Model Scale Group=Comp...
2025.12
0
Feedback
Search any
task
Search any
task