Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Search Ranking on Unseen search data (test)
Loading...
11.8
M vs V2 Score
V1
-0.472
2.714
5.9
9.086
Mar 23, 2026
M vs V2 Score
Updated 24d ago
Evaluation Results
Method
Method
Links
M vs V2 Score
V1
Key Innovations=Mean-P...
2026.03
11.8
V3.5
Key Innovations=V3.4 +...
2026.03
8.3
V3.4
Key Innovations=V3.3 +...
2026.03
6.3
V3.3
Key Innovations=V3.2 +...
2026.03
6.1
V3.1
Key Innovations=Transf...
2026.03
5
V3.0
Key Innovations=V2 + P...
2026.03
4.7
V3.2
Key Innovations=V3.1 +...
2026.03
4.1
V2
Key Innovations=Transf...
2026.03
0
Feedback
Search any
task
Search any
task