| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LVLM Evaluation Set (test) | KGW | Relative Drop0 | 36 | 3mo ago | |
| DTS Paraphrase - ChatGPT | Lexical ICW | ROC-AUC0.924 | 11 | 1mo ago | |
| DTS Deletion - 30% | Letter ICW | ROC-AUC0.998 | 9 | 1mo ago | |
| DTS Replacement 30% | Acrostics ICW | ROC-AUC100 | 9 | 1mo ago | |
| Unspecified robustness evaluation set (test) | BER (−40–55°)1.08 | 6 | 2mo ago | ||
| Robustness (test) | BER (40–49 cm)0.78 | 6 | 2mo ago | ||
| Dolly CW | BIRA | KGW100 | 5 | 6d ago | |
| Drybean | GLW | Robustness Score (Row Deletion)116.96 | 5 | 22d ago | |
| Default | TAB-DRW | Robustness: Row Deletion33.92 | 5 | 22d ago | |
| Shoppers | TAB-DRW | Row Deletion Robustness38.43 | 5 | 22d ago | |
| Magic | Row Deletion Robustness18.92 | 5 | 22d ago | ||
| robustness evaluation set (test) | BER (0.05 Ratio)0.31 | 2 | 2mo ago | ||
| CIFAR-10 (test) | - | - | 0 | 3mo ago |