Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Skill Retrieval on BigCodeBench
Loading...
73.2
nDCG@1
Qwen3-32B
48.24
54.72
61.2
67.68
Apr 27, 2026
nDCG@1
nDCG@10
Updated 1mo ago
Evaluation Results
Method
Method
Links
nDCG@1
nDCG@10
Qwen3-32B
Retrieval Stage=Rerank...
2026.04
73.2
74.1
Mistral3.1-24B
Retrieval Stage=Rerank...
2026.04
72
72.9
Llama-3.3-70B
Retrieval Stage=Rerank...
2026.04
71.1
72.1
Qwen3-235B
Retrieval Stage=Rerank...
2026.04
70.7
73.1
Qwen3-4B
Retrieval Stage=Rerank...
2026.04
68.4
66.6
BM25
Retrieval Stage=First-...
2026.04
61.7
55.4
Hybrid
Retrieval Stage=First-...
2026.04
61.7
59.9
Llama-3.1-8B
Retrieval Stage=Rerank...
2026.04
59.2
61.6
TF-IDF
Retrieval Stage=First-...
2026.04
55.2
52
BGE
Retrieval Stage=First-...
2026.04
54
53.6
Contriever
Retrieval Stage=First-...
2026.04
49.2
47.9
Feedback
Search any
task
Search any
task