Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Interpretation Alignment on Ambrosia sampled 30 ambiguous examples
Loading...
0.917
Alignment Score
IntentRL
0.87115
0.894075
0.917
0.939925
Nov 13, 2025
Alignment Score
Agreement Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Alignment Score
Agreement Score
IntentRL
Model=4B Instruct
2025.11
0.917
0.84
Feedback
Search any
task
Search any
task