Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Helpful assistant task on Tulu-2 13B
Loading...
1.2562
HV Score
UniARM
0.9624
1.038675
1.11495
1.191225
Feb 10, 2026
HV Score
MIP Score
Updated 4d ago
Evaluation Results
Method
Method
Links
HV Score
MIP Score
UniARM
Inference Time (s)=8.6...
2026.02
1.2562
1.34
PARM
Inference Time (s)=8.6...
2026.02
1.1916
1.21
GenARM
Inference Time (s)=18....
2026.02
0.9737
1.07
Feedback
Search any
task
Search any
task