Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Data Analysis on RealHitBench

79.55GPT Score

DeepSeek-R1

20.176435.590751.00566.4193Jun 16, 2025Jul 17, 2025Aug 18, 2025Sep 19, 2025Oct 20, 2025Nov 21, 2025Dec 23, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.06
79.5542.59
2025.06
79.4537.08
2025.06
77.2637.63
2025.06
75.2136.01
2025.06
74.836.26
2025.06
73.3736.36
2025.06
72.0535.25
2025.06
70.8334.44
2025.06
70.7236.17
2025.06
68.6620.86
2025.06
68.4535.9
2025.06
67.2236.26
2025.12
66.67-
2025.12
66.53-
2025.12
66.29-
2025.06
65.2433.1
2025.12
64.99-
2025.12
63.03-
2025.06
62.7633.25
2025.12
62.04-
2025.12
60.12-
2025.06
60.1232.25
2025.06
57.7419.84
57.2831.87
2025.12
55.54-
2025.06
54.5526.98
2025.06
54.3925.19
2025.12
53.28-
2025.12
53.27-
2025.12
53.16-
2025.06
53.0632.36
52.7530.59
2025.06
52.2627.98
2025.12
47.86-
2025.06
47.8627.26
2025.06
47.318.86
41.3222.75
41.1825.46
2025.06
40.1724.17
2025.06
39.9822.43
2025.06
39.6925.39
2025.06
38.0216.24
2025.06
37.4124.3
2025.12
36.24-
2025.06
30.7118.53
2025.06
26.617.74
2025.06
25.4612.13
2025.06
24.737.89
2025.06
22.468.72