Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ZebraLogicBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Adding MistakeZebraLogicBench (ZLB)
AOC83.8
7
Truncated CoT AnsweringZebraLogicBench (ZLB)
AOC58.7
7
Showing 2 of 2 rows