Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Seed1.8 Model Card: Towards Generalized Real-World Agency

About

We present Seed1.8, a foundation model aimed at generalized real-world agency: going beyond single-turn prediction to multi-turn interaction, tool use, and multi-step execution. Seed1.8 keeps strong LLM and vision-language performance while supporting a unified agentic interface-search, code generation and execution, and GUI interaction. For deployment, it offers latency- and cost-aware inference, including configurable thinking modes and optimized visual encoding for images and video. We report evaluations on standard benchmarks and application-aligned workflows spanning foundational skills, multimodal understanding, and agentic behavior. Seed1.8 is released to support further research and development on interactive, real-world use cases.

Bytedance Seed• 2026

Related benchmarks

TaskDatasetResultRank
Video UnderstandingVideoMME--
248
Streaming Video UnderstandingStreamingBench--
158
Video UnderstandingLongVideoBench--
92
Temporal Video UnderstandingTempCompass
Average Score86.9
68
Video UnderstandingLVBench
Average Score73
67
Computer UseOSWorld--
42
Multimodal ReasoningMathVista
Pass@187.7
36
Video UnderstandingVideoMMMU--
32
Domain-Specific Applied TasksEducation, Customer Support Q&A, Information Processing, Intention Recognition, Information Extraction, Complex Workflow
Pass@169
30
MathematicsAIME-25, HMMT25(Feb), BeyondAIME, AMO-Bench, IMO-AnswerBench
Pass@194.3
25
Showing 10 of 65 rows

Other info

Follow for update