Seed1.8 Model Card: Towards Generalized Real-World Agency
About
We present Seed1.8, a foundation model aimed at generalized real-world agency: going beyond single-turn prediction to multi-turn interaction, tool use, and multi-step execution. Seed1.8 keeps strong LLM and vision-language performance while supporting a unified agentic interface-search, code generation and execution, and GUI interaction. For deployment, it offers latency- and cost-aware inference, including configurable thinking modes and optimized visual encoding for images and video. We report evaluations on standard benchmarks and application-aligned workflows spanning foundational skills, multimodal understanding, and agentic behavior. Seed1.8 is released to support further research and development on interactive, real-world use cases.
Bytedance Seed• 2026
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Understanding | VideoMME | -- | 248 | |
| Streaming Video Understanding | StreamingBench | -- | 158 | |
| Video Understanding | LongVideoBench | -- | 92 | |
| Temporal Video Understanding | TempCompass | Average Score86.9 | 68 | |
| Video Understanding | LVBench | Average Score73 | 67 | |
| Computer Use | OSWorld | -- | 42 | |
| Multimodal Reasoning | MathVista | Pass@187.7 | 36 | |
| Video Understanding | VideoMMMU | -- | 32 | |
| Domain-Specific Applied Tasks | Education, Customer Support Q&A, Information Processing, Intention Recognition, Information Extraction, Complex Workflow | Pass@169 | 30 | |
| Mathematics | AIME-25, HMMT25(Feb), BeyondAIME, AMO-Bench, IMO-AnswerBench | Pass@194.3 | 25 |
Showing 10 of 65 rows