| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| TF10 | SPADE | Normalized Median Score0.748 | 25 | 21d ago | |
| TF8 | SPADE | Normalized Median Score67.9 | 25 | 21d ago | |
| LLM-DM | Normalized Median Score100 | 25 | 21d ago | ||
| D’Kitty | PGS | Normalized Median Score0.941 | 25 | 21d ago | |
| Ant | SPADE | Normalized Median Score0.935 | 25 | 21d ago | |
| SuperC | Normalized Median Score46.3 | 25 | 21d ago | ||
| Overall Task Suite SuperC, Ant, D’Kitty, LLM-DM, TF8, TF10 | SPADE | Mean Rank1.7 | 24 | 21d ago | |
| Design-bench 100-th percentile | N2CE | TFBIND8 Score98.3 | 20 | 1mo ago | |
| Design-Bench | N2CE | TFBIND8 Score0.99 | 11 | 1mo ago | |
| 2D Branin function top-10%-tile-removed (held-out) | Noisier NCE | Achieved Function Value0.4 | 4 | 1mo ago |