Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning on LLC (test)
Loading...
90.7
Accuracy
TCR-gold
4.692
27.021
49.35
71.679
Jan 29, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
TCR-gold
Backbone=LLaMA3-8B-Ins...
2026.01
90.7
TCR
Backbone=LLaMA3-8B-Ins...
2026.01
82.3
LLaMA3-8B-Instruct
Backbone=LLaMA3-8B-Ins...
2026.01
81
DoLa
Backbone=LLaMA3-8B-Ins...
2026.01
71.2
TCR-gold
Backbone=Qwen3-8B-Inst...
2026.01
71
TCR
Backbone=Qwen3-8B-Inst...
2026.01
68
Qwen3-8B-Instruct
Backbone=Qwen3-8B-Inst...
2026.01
67.5
TCR-gold
Backbone=Phi-3-Instruc...
2026.01
59.6
DoLa
Backbone=Qwen3-8B-Inst...
2026.01
57
TCR
Backbone=Phi-3-Instruc...
2026.01
56.1
Phi-3-Instruct
Backbone=Phi-3-Instruc...
2026.01
53.1
DoLa
Backbone=Phi-3-Instruc...
2026.01
29.5
TCR-gold
Backbone=Qwen2.5-7B-In...
2026.01
23
TCR
Backbone=Qwen2.5-7B-In...
2026.01
16.2
Qwen2.5-7B-Instruct
Backbone=Qwen2.5-7B-In...
2026.01
11.7
DoLa
Backbone=Qwen2.5-7B-In...
2026.01
8
Feedback
Search any
task
Search any
task