| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| CRUXEval L2 | thought-tree-based classifier | Accuracy77 | 4 | 1mo ago | |
| SAFIM L3 | thought-tree-based classifier | Accuracy81 | 2 | 1mo ago | |
| SAFIM L1 | thought-tree-based classifier | Accuracy79 | 2 | 1mo ago | |
| CRUXEval (L3) | thought-tree-based classifier | Accuracy74 | 2 | 1mo ago | |
| CRUXEval L1 | thought-tree-based classifier | Accuracy89 | 2 | 1mo ago |