Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Efficiency Evaluation on GPT-2 124M
Loading...
1
Inference Speed (x)
FP32 Baseline
0.95
0.975
1
1.025
Jan 5, 2026
Inference Speed (x)
Model Size (MB)
Updated 4d ago
Evaluation Results
Method
Method
Links
Inference Speed (x)
Model Size (MB)
FP32 Baseline
Precision=float32, Qua...
2026.01
1
-
Feedback
Search any
task
Search any
task