| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MT-Bench | TRACT | Pearson's r0.672 | 16 | 4d ago | |
| Vicuna Bench | TRACT | Pearson Correlation (r)0.605 | 16 | 4d ago | |
| FLASK | TRACT | Pearson's r0.518 | 16 | 4d ago | |
| FB Bench (Feedback Bench) | Pearson's r0.932 | 16 | 4d ago | ||
| 100 Romanian synthetic prompts (test) | Fluency4.71 | 7 | 4d ago |