| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LS1 -> LS2 (test) | PCogAlign | RSA4.032 | 13 | 1mo ago | |
| Assistant and Summary personalization tasks (test) | vol-mo | Win Rate83.91 | 12 | 1mo ago | |
| Real-world failure cases from large-scale commercial PA | RP-Reasoner | Macro Accuracy73.4 | 4 | 1mo ago | |
| RPEVAL | RP-Reasoner | Macro Accuracy24 | 4 | 1mo ago |