Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Search Relevance benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Search Relevance
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Minor Language family Qwen2.5 series benchmark (test)
SERM
NDCG@1
84.99
18
3mo ago
Romance language family Qwen2.5 series benchmark (test)
SERM
NDCG@1
88.14
18
3mo ago
Germanic language family Qwen2.5 series benchmark (test)
SERM
NDCG@1
87.56
18
3mo ago
ESCI
DeBERTa-v3-large
Macro F1
61.03
14
2mo ago
WANDs
DeBERTa-v3-large
Macro F1
91.39
12
2mo ago
JD.com Search Traffic Online Evaluation (A/B test)
K-CARE
Bad Case Rate
11.39
3
1mo ago
Taobao Visual Search (Offline Evaluation Set)
REVISION
Top-1 Relevance
66.56
2
3mo ago
Manual Annotation Queries Knowledge 2,000 queries
TaoSR1
GSB
18.45
1
2mo ago
Manual Annotation Queries Negative 2,000 queries
TaoSR1
GSB Score
10.92
1
2mo ago
Manual Annotation Queries Alternative 2,000 queries
TaoSR1
GSB Score
34.43
1
2mo ago
Manual Annotation Queries Q&A 2,000 queries
TaoSR1
GSB
16.62
1
2mo ago
Online Search Platform Longtail Traffic Current
SERM (Distilled Qwen2.5-7B)
Change Query Ratio
-0.1312
1
3mo ago
Online Search Platform Overall Current (Live Traffic)
SERM (Distilled Qwen2.5-7B)
User Negative Feedback
-1.2081
1
3mo ago
Showing 13 of 13 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs