Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Downstream Task Evaluation on Downstream Tasks Aggregate
Loading...
54.37
Accuracy
SPIRALFORMER-L
39.3004
43.2127
47.125
51.0373
Feb 12, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
SPIRALFORMER-L
Model Scale=1.4B, Topo...
2026.02
54.37
SPIRALFORMER-L
Model Scale=1.4B, Topo...
2026.02
51.75
BASELINE (PYTHIA)
Model Scale=160M, Eval...
2026.02
39.88
Feedback
Search any
task
Search any
task