Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Evaluation on SuperGPQA Continual
Loading...
15.85
Accuracy
STOC
9.35
11.0375
12.725
14.4125
May 11, 2026
Accuracy
Updated 22d ago
Evaluation Results
Method
Method
Links
Accuracy
STOC
Model Scale=1.7B, Prom...
2026.05
15.85
STOC
Model Scale=0.6B, Prom...
2026.05
15.24
LAMOL
Model Scale=0.6B, Prom...
2026.05
13.87
LAMOL
Model Scale=1.7B, Prom...
2026.05
13.35
Naive
Model Scale=0.6B, Prom...
2026.05
10.98
Naive
Model Scale=1.7B, Prom...
2026.05
9.6
Feedback
Search any
task
Search any
task