Share your thoughts, 1 month free Claude Pro on usSee more

Entity counting on Entity counting Qwen3-8B prompts N=200x3 seeds (test)

97Greedy Generation Accuracy

LoRA Q/V rank-16 (entity-counting only)

Updated 2mo ago

Evaluation Results

Method	Links
LoRA Q/V rank-16 (entity-counting only) 2026.05		97	-	-
Probe-round (oracle UB) 2026.05		96	98.7	-
LoRA Q/V rank-16 2026.05		83.1	96	91.7
Hard DPS 2026.05		72.4	98.7	-
LoRA Q/V rank-16 (per-seed, multi-task) 2026.05		71.5	-	-
Baseline (entity counting) 2026.05		7.2	13.7	0
9-row lm_head repair 2026.05		0	60.7	60.3
Soft DPS 2026.05		-	13.2	-
Norm rescaling (digit ×3) 2026.05		-	-	26.5