Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Character Count Evaluation on Qwen3-8B 200 Prompts (test)
Loading...
98
Accuracy
9-row repair
47.352
60.501
73.65
86.799
May 5, 2026
Accuracy
Updated 28d ago
Evaluation Results
Method
Method
Links
Accuracy
9-row repair
Intervention type=9-ro...
2026.05
98
Probe-round
Intervention type=Prob...
2026.05
96.8
Fullvocab repair
Intervention type=Full...
2026.05
57.7
Baseline
Intervention type=None...
2026.05
49.3
Feedback
Search any
task
Search any
task