Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Entity counting

Benchmarks

Task NameDataset NameSOTA ResultTrend
Entity countingEntity counting Qwen3-8B prompts N=200x3 seeds (test)
Greedy Generation Accuracy97
7
Entity countingEntity counting 200 prompts Qwen3-8B (test)
Accuracy98.7
5
Showing 2 of 2 rows