| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Toy dataset 0% label noise (test) | SimPO | Accuracy99.6 | 76 | 4d ago | |
| Toy dataset 50% label noise (test) | SSPO | Accuracy75.7 | 24 | 4d ago | |
| Toy dataset Noise 30% (test) | SSPO | Accuracy0.739 | 12 | 4d ago | |
| Toy dataset Noise 10% (test) | SSPO | Accuracy93.1 | 12 | 4d ago | |
| Tennis (test) | C-GPM | Test AUC0.58 | 4 | 4d ago | |
| Pokémon (test) | GPM | Test AUC86 | 4 | 4d ago | |
| Chameleon (test) | GPM | Test AUC92 | 4 | 4d ago | |
| Synthetic (test) | GPM | Test AUC98 | 4 | 4d ago | |
| Anthropic HH-RLHF+VI Preference (test) | MC-STL | Overall Accuracy64 | 3 | 4d ago | |
| ML-100K (test) | AUC69.5 | 2 | 4d ago | ||
| UCI (test) | AUC56.5 | 2 | 4d ago | ||
| 3 Grades (test) | AUC53.22 | 2 | 4d ago | ||
| LSAT (test) | Spectral algorithm | AUC0.707 | 2 | 4d ago | |
| Website (test) | C-GPM | Test AUC0.66 | 2 | 4d ago |