Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Expert-Level Human Knowledge Reasoning on Humanity's Last Exam

38.3Pass@1

Tongyi DeepResearch

16.87622.4382833.562Oct 28, 2025
Updated 15d ago

Evaluation Results

MethodLinks
2025.10
38.3
2025.10
32.9
2025.10
29.8
2025.10
26.9
2025.10
26.9
2025.10
26.6
2025.10
24.9
2025.10
21.2
2025.10
20.3
2025.10
18.1
2025.10
17.7