Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Skill retrieval on CHAMP (Recall@K)
Loading...
22.5
Recall@1
Llama-3.3-70B
2.948
8.024
13.1
18.176
Apr 27, 2026
Recall@1
Recall@10
Updated 1mo ago
Evaluation Results
Method
Method
Links
Recall@1
Recall@10
Llama-3.3-70B
Retrieval Stage=LLM-ba...
2026.04
22.5
47.8
Qwen3-32B
Retrieval Stage=LLM-ba...
2026.04
22.3
49.1
Qwen3-235B
Retrieval Stage=LLM-ba...
2026.04
22.1
50.2
Qwen3-4B
Retrieval Stage=LLM-ba...
2026.04
18.5
44
Mistral3.1-24B
Retrieval Stage=LLM-ba...
2026.04
18
49.3
Llama-3.1-8B
Retrieval Stage=LLM-ba...
2026.04
15.8
41.6
BM25
Retrieval Stage=First-...
2026.04
13.2
36.1
Hybrid
Retrieval Stage=First-...
2026.04
13.2
41.4
BGE
Retrieval Stage=First-...
2026.04
9.8
34
TF-IDF
Retrieval Stage=First-...
2026.04
7.2
25.7
Contriever
Retrieval Stage=First-...
2026.04
3.7
29.3
Feedback
Search any
task
Search any
task