Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Numerical Reasoning on TableBench (test)
Loading...
64.48
Accuracy
Qwen3-8B
1.352
17.741
34.13
50.519
May 18, 2026
Accuracy
Updated 13d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-8B
Strategy=TGN
2026.05
64.48
Qwen3-4B
Strategy=SCoT
2026.05
61.46
DeepSeek-R1-Distill-Llama-8B
Strategy=PIP
2026.05
50.13
Qwen3-1.7B
Strategy=PIP
2026.05
43.83
TableLLM-Qwen2-7B
Strategy=PIP
2026.05
38.54
Qwen2.5-Coder-7B-Instruct
Strategy=CoT
2026.05
37.03
TableGPT2-7B
Strategy=PIP
2026.05
31.23
Qwen2.5-7B-Instruct
Strategy=CoT
2026.05
25.69
Qwen2-7B-Instruct
Strategy=PIP
2026.05
24.69
Meta-Llama-3-8B-Instruct
Strategy=ReAct
2026.05
21.91
Llama-3.1-8B-Instruct
Strategy=ReAct
2026.05
13.85
Llama-3.2-3B-Instruct
Strategy=DP
2026.05
4.03
Llama-3.2-3B-Instruct
Strategy=TGN
2026.05
3.78
Feedback
Search any
task
Search any
task