Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Seed1.8 Model Card: Towards Generalized Real-World Agency

About

We present Seed1.8, a foundation model aimed at generalized real-world agency: going beyond single-turn prediction to multi-turn interaction, tool use, and multi-step execution. Seed1.8 keeps strong LLM and vision-language performance while supporting a unified agentic interface-search, code generation and execution, and GUI interaction. For deployment, it offers latency- and cost-aware inference, including configurable thinking modes and optimized visual encoding for images and video. We report evaluations on standard benchmarks and application-aligned workflows spanning foundational skills, multimodal understanding, and agentic behavior. Seed1.8 is released to support further research and development on interactive, real-world use cases.

Bytedance Seed• 2026

Related benchmarks

TaskDatasetResultRank
Video UnderstandingVideoMME--
357
Streaming Video UnderstandingStreamingBench--
259
Temporal Video UnderstandingTempCompass--
141
Video UnderstandingLongVideoBench--
123
Video UnderstandingMMVU--
76
Video UnderstandingLVBench
Average Score73
75
Video UnderstandingVideoMMMU--
59
Computer UseOSWorld--
45
Multimodal ReasoningMathVista
Pass@187.7
36
GUI NavigationOSWorld (Verified)
OS Success Rate66.67
32
Showing 10 of 74 rows
...

Other info

Follow for update