Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Broad information seeking on WideSearch English
Loading...
62
Item F1
Claude-4-Sonnet
43.384
48.217
53.05
57.883
Mar 16, 2026
Item F1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Item F1
Claude-4-Sonnet
# Samples=?, # OS Samp...
2026.03
62
OpenAI-o3
# Samples=?, # OS Samp...
2026.03
60
Kimi-K2-Instruct-1T
# Samples=?, # OS Samp...
2026.03
59.9
OpenSeeker-v1-30B-SFT
# Samples=11.7 k, # OS...
2026.03
59.4
WebLeaper-30B
# Samples=15 k, # OS S...
2026.03
44.1
Feedback
Search any
task
Search any
task