| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| BPO Eval (test) | BPO + Vicuna-v1.3 13B | Win Rate (A)59.5 | 7 | 4d ago | |
| Dolly Eval | BPO + Llama-2-chat 13B (Cross-size) | A Win Rate54 | 7 | 4d ago | |
| Self-instruct Eval | BPO + Llama-2-chat 7B | Win Rate (A)53.6 | 7 | 4d ago | |
| Vicuna Eval v1.3 (test) | BPO + Vicuna-v1.3 7B | A Win Rate65 | 7 | 4d ago |