| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Synthetic personalized interaction datasets (evaluation) | Task Completion Score8.48 | 10 | 4d ago | ||
| Real-world (test) | Score8.09 | 6 | 4d ago | ||
| L-IVA 1.0 (test) | ORCA | Task Success Rate - Kit73.8 | 4 | 4d ago | |
| Internal Task Benchmark | Avg Connection Time (hours)0 | 3 | 4d ago |