Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning on ObjC (test)
Loading...
90
Accuracy
TCR-gold
26.56
43.03
59.5
75.97
Jan 29, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
TCR-gold
Backbone=Qwen3-8B-Inst...
2026.01
90
TCR
Backbone=Qwen3-8B-Inst...
2026.01
88.1
Qwen3-8B-Instruct
Backbone=Qwen3-8B-Inst...
2026.01
85
DoLa
Backbone=Qwen3-8B-Inst...
2026.01
82.7
TCR-gold
Backbone=LLaMA3-8B-Ins...
2026.01
76.4
TCR-gold
Backbone=Qwen2.5-7B-In...
2026.01
76
DoLa
Backbone=LLaMA3-8B-Ins...
2026.01
71.2
LLaMA3-8B-Instruct
Backbone=LLaMA3-8B-Ins...
2026.01
68.8
TCR
Backbone=LLaMA3-8B-Ins...
2026.01
67.8
TCR
Backbone=Qwen2.5-7B-In...
2026.01
56
DoLa
Backbone=Qwen2.5-7B-In...
2026.01
52.3
Qwen2.5-7B-Instruct
Backbone=Qwen2.5-7B-In...
2026.01
52
TCR
Backbone=Phi-3-Instruc...
2026.01
45
TCR-gold
Backbone=Phi-3-Instruc...
2026.01
45
Phi-3-Instruct
Backbone=Phi-3-Instruc...
2026.01
30.9
DoLa
Backbone=Phi-3-Instruc...
2026.01
29
Feedback
Search any
task
Search any
task