Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Max extraction on Max-extraction (test)
Loading...
40
Accuracy
Best probe
9.944
17.747
25.55
33.353
May 5, 2026
Accuracy
95% Confidence Interval
Updated 28d ago
Evaluation Results
Method
Method
Links
Accuracy
95% Confidence Interval
Best probe
Layer=33, R^2=0.757
2026.05
40
30
DPS
alpha=5
2026.05
40
30
Random direction
alpha=10, seeds=20
2026.05
12.4
-
Baseline (vanilla Qwen3-8B)
Condition=Baseline (va...
2026.05
11.1
5.6
Feedback
Search any
task
Search any
task