Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Tool Use on tau2-Bench
Loading...
64
Accuracy
PivotRL
47.36
51.68
56
60.32
Mar 22, 2026
Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
Accuracy
PivotRL
Base Model=Nemotron-3-...
2026.03
64
Nemotron-3-Super
Post-training Stage=SFT
2026.03
48
Feedback
Search any
task
Search any
task