Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Alignment on Arena-Hard (hard prompt gemini)
Loading...
70.4
Hard Prompt Gemini Score
SnapMLA
54.904
58.927
62.95
66.973
Feb 11, 2026
Hard Prompt Gemini Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Hard Prompt Gemini Score
SnapMLA
Backbone=LongCat-Flash...
2026.02
70.4
FlashMLA
Backbone=LongCat-Flash...
2026.02
69.9
FlashMLA
Backbone=DeepSeek-V3.1...
2026.02
57.1
SnapMLA
Backbone=DeepSeek-V3.1...
2026.02
55.5
Feedback
Search any
task
Search any
task