| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| SALAD Alpaca HarmBench | Residual Paving (oracle) | Target Success99.8 | 18 | 13d ago | |
| Phi/Qwen judged grid | GoR/DIM ablation | Target Success Rate33.3 | 7 | 13d ago | |
| N512 objective checks | Residual Paving | Target Success Rate94.8 | 4 | 13d ago |