Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Skill retrieval on LogicBench (Recall@1, Recall@10)
Loading...
31.4
Recall@1
Qwen3-32B
0.616
8.608
16.6
24.592
Apr 27, 2026
Recall@1
Recall@10
Updated 1mo ago
Evaluation Results
Method
Method
Links
Recall@1
Recall@10
Qwen3-32B
Retrieval Stage=LLM-ba...
2026.04
31.4
55.3
Qwen3-235B
Retrieval Stage=LLM-ba...
2026.04
30.9
56.4
Llama-3.3-70B
Retrieval Stage=LLM-ba...
2026.04
27.4
55.4
Mistral3.1-24B
Retrieval Stage=LLM-ba...
2026.04
26.7
53.2
Qwen3-4B
Retrieval Stage=LLM-ba...
2026.04
21.8
43.3
Llama-3.1-8B
Retrieval Stage=LLM-ba...
2026.04
15.9
42
BM25
Retrieval Stage=First-...
2026.04
12
36.1
Hybrid
Retrieval Stage=First-...
2026.04
12
33.6
Contriever
Retrieval Stage=First-...
2026.04
5.5
18.4
BGE
Retrieval Stage=First-...
2026.04
4.1
20.5
TF-IDF
Retrieval Stage=First-...
2026.04
1.8
18.2
Feedback
Search any
task
Search any
task