| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| SNLI Hypothesis | iFlip-Conf | LFR83 | 37 | 1mo ago | |
| SNLI Premise | FIZLE | LFR0.759 | 37 | 1mo ago | |
| AG News | iFlip-NL | LFR0.915 | 37 | 1mo ago | |
| IMDb | iFlip-Conf | LFR100 | 37 | 1mo ago | |
| SST2 (test) | POLYJUICE | SLFR29 | 29 | 1mo ago | |
| AG News (test) | ZEROCF | SLFR98 | 29 | 1mo ago | |
| AVICI (test) | DoWhy | LIN RMSE (IN)0 | 16 | 12d ago | |
| AI-READI (Class 1) | Llama* | Validity98 | 9 | 1mo ago | |
| AI-READI Class 0 | GPT-4 | Validity0.99 | 9 | 1mo ago | |
| StableDiffusion 3 Evaluation Set | Gradient Ascent | LogitTgt24.875 | 6 | 1mo ago | |
| ImageNet-1k | Gradient Ascent | LogitTgt24.875 | 6 | 1mo ago | |
| FMCW radar dataset | SPARCE | Proximity450.6 | 6 | 1mo ago | |
| FMCW radar dataset diagonal gestures | GenFacts | Interpretability Score90.4 | 6 | 1mo ago | |
| AFHQ STYLEGAN2 | PCG | FID8.3 | 5 | 1mo ago | |
| PlantVillage | PCG | L1 Loss0.36 | 5 | 1mo ago | |
| FFHQ | PCG | L1 Distance0.42 | 5 | 1mo ago | |
| AFHQ | PCG | L1 Distance0.79 | 5 | 1mo ago | |
| MIMIC Chest X-ray 192x192 (test) | OT-FLOW | Composition MAE0.1835 | 4 | 24d ago | |
| Strong-3DIdent | Natural Counterfactuals | MAE (distance)0.058 | 4 | 1mo ago | |
| 3DIdent Weak | Natural Counterfactuals | MAE (d)0.024 | 2 | 1mo ago | |
| MorphoMNIST | D'ARTAGNAN | MSE (Ground Truth vs Reconstruction)2.303 | 2 | 1mo ago |