Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Sequence Modeling on Dyck-n
Loading...
64.4
Acc
Llama-3.1-8B
-2.576
14.812
32.2
49.588
Feb 4, 2026
Acc
NLDD
Updated 4d ago
Evaluation Results
Method
Method
Links
Acc
NLDD
Llama-3.1-8B
Regime=Faithful Regime...
2026.02
64.4
3
DeepSeek-Coder-6.7B
Regime=Faithful Regime...
2026.02
47.2
9.5
Gemma-2-9B
Regime=Anti-Faithful R...
2026.02
0
12.5
Feedback
Search any
task
Search any
task