Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

synthetic decision-making rounds

Benchmarks

Task NameDataset NameSOTA ResultTrend
Decision-making30 synthetic decision-making rounds (evaluation)
Mean Rank2.55
14
Showing 1 of 1 rows