Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

F1

Benchmarks

Task NameDataset NameSOTA ResultTrend
Tool CallingF1 Average
Tool Call Name F191.37
16
Multi-objective OptimizationF1
Hypervolume1.011
14
Global OptimizationF1
Final Error0
14
Black-box Optimizationf1
NPR Mean1.038
9
Showing 4 of 4 rows