Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Trace-based

Benchmarks

Task NameDataset NameSOTA ResultTrend
Tool ExecutionTrace-based setting
Improvement (%)14.8
4
Tool SelectionTrace-based setting
Improvement6.8
4
Showing 2 of 2 rows