| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| ROME Shortcut Decision-Making 1.0 | ARISE (Std) | F1-Score94.53 | 24 | 28d ago | |
| ROME Contextual Ambiguity 1.0 | ARISE (Std) | F1 Score91.92 | 24 | 28d ago | |
| ROME Implicit Risks 1.0 (IR) | ARISE (Std) | F1-Score86.81 | 24 | 28d ago | |
| ROME Original 1.0 (Seed) | Abl. Unsafe | F1-Score94.53 | 24 | 28d ago | |
| ROME IR unsafe | F1 Score95.74 | 4 | 28d ago | ||
| ROME CA unsafe subset | F1 Score97.35 | 4 | 28d ago | ||
| ROME SDM unsafe | F1 Score100 | 4 | 28d ago |