Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Deliberative decision-making tasks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Decision-makingDeliberative decision-making tasks n=45 (overall)
Mean Tokens237,565
5
Showing 1 of 1 rows