Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Count Frequency

Benchmarks

Task NameDataset NameSOTA ResultTrend
Cooperative Multi-agent Problem SolvingCount Frequency (Hard)
Detected Error1.67
9
Cooperative Multi-agent Problem SolvingCount Frequency (Medium)
Detected Error1.33
9
Cooperative Multi-agent Problem SolvingCount Frequency (Easy)
Detected Error1.33
9
Showing 3 of 3 rows