Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Science Reasoning on GPQA 5-shot
Loading...
33.3
Accuracy
Baseline
28.828
29.989
31.15
32.311
May 9, 2026
Accuracy
Tokens Per Second
Steps
Speed Multiplier
Updated 21d ago
Evaluation Results
Method
Method
Links
Accuracy
Tokens Per Second
Steps
Speed Multiplier
Baseline
Base Model=Dream-7B-In...
2026.05
33.3
0.5
256
1
KLASS
Base Model=Dream-7B-In...
2026.05
32.6
11.6
8
24.8
LEAP
Base Model=Dream-7B-In...
2026.05
32.4
22
4
62
LEAP
Base Model=LLaDA-8B-In...
2026.05
32.1
12.1
17
13.3
Conf-Based
Base Model=Dream-7B-In...
2026.05
32.1
22.3
5
49.6
Conf-Based
Base Model=LLaDA-8B-In...
2026.05
31.9
8.3
28
9.1
Baseline
Base Model=LLaDA-8B-In...
2026.05
30.6
4
256
1
LoPA
Base Model=Dream-7B-In...
2026.05
30.6
7.6
6
20.7
KLASS
Base Model=LLaDA-8B-In...
2026.05
29
10.1
124
2.1
Feedback
Search any
task
Search any
task