| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Braceval | Accuracy70.8 | 15 | 2mo ago | ||
| Grocery Domain | GOOD | Human Score (%)75.78 | 3 | 26d ago | |
| Robot Domain | GOOD | Human Score88.57 | 3 | 26d ago | |
| Online Chinese conversation data 500 multi-turn proprietary dialogues (test) | C-SFT-Empathy | Win Rate vs GPT479.4 | 2 | 1mo ago |