Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Knowledge-Intensive Reasoning on HLE

85Avg Score

R1-Searcher

0.03222.09144.1566.209Aug 8, 2025Sep 24, 2025Nov 10, 2025Dec 28, 2025Feb 13, 2026Apr 1, 2026May 19, 2026
Updated 14d ago

Evaluation Results

MethodLinks
2026.01
85-
2026.01
85-
2026.01
85-
2026.01
81-
2026.01
80-
2026.01
76-
2026.01
74-
2026.01
74-
2026.01
72-
37.98-
2026.05
28.64-
28.55-
2026.05
28.51-
2026.05
28.13-
2026.05
25.39-
23.21-
22.24-
21.59-
2026.05
21.49-
2026.05
21.31-
21.31-
21.12-
20.52-
2026.05
20.29-
2026.05
19.31-
2026.05
18.48-
2026.05
18.34-
2026.05
16.95-
2026.05
14.44-
14.39-
2026.05
14.35-
2026.05
13.46-
2026.05
13.46-
2026.05
13.37-
2026.05
12.67-
12.58-
2026.05
12.3-
2026.05
11.65-
2026.05
11.56-
2026.05
10.96-
2026.05
10.82-
2026.05
9.52-
9.29-
2026.05
8.73-
8.5-
2026.05
8.03-
2026.05
7.75-
2026.05
7.66-
7.15-
2026.01
7-
2026.05
6.92-
2026.05
6.73-
2026.05
6.55-
2026.01
6.4-
2026.05
6.04-
6.04-
2026.05
5.99-
2026.01
5.7-
2026.01
5.6-
5.57-
2026.05
5.34-
5.31
2026.01
5.2-
2026.05
5.06-
2026.01
4.6-
2026.01
4.5-
4.22-
2026.01
4.2-
2025.08
42
3.99-
2026.01
3.9-
2025.08
3.93
2026.05
3.81-
2025.08
3.54
2025.08
3.35