Seed1.8 Model Card: Towards Generalized Real-World Agency
About
We present Seed1.8, a foundation model aimed at generalized real-world agency: going beyond single-turn prediction to multi-turn interaction, tool use, and multi-step execution. Seed1.8 keeps strong LLM and vision-language performance while supporting a unified agentic interface-search, code generation and execution, and GUI interaction. For deployment, it offers latency- and cost-aware inference, including configurable thinking modes and optimized visual encoding for images and video. We report evaluations on standard benchmarks and application-aligned workflows spanning foundational skills, multimodal understanding, and agentic behavior. Seed1.8 is released to support further research and development on interactive, real-world use cases.
Bytedance Seed• 2026
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Understanding | VideoMME | -- | 357 | |
| Streaming Video Understanding | StreamingBench | -- | 259 | |
| Temporal Video Understanding | TempCompass | -- | 141 | |
| Video Understanding | LongVideoBench | -- | 123 | |
| Video Understanding | MMVU | -- | 76 | |
| Video Understanding | LVBench | Average Score73 | 75 | |
| Video Understanding | VideoMMMU | -- | 59 | |
| Computer Use | OSWorld | -- | 45 | |
| Multimodal Reasoning | MathVista | Pass@187.7 | 36 | |
| GUI Navigation | OSWorld (Verified) | OS Success Rate66.67 | 32 |
Showing 10 of 74 rows
...