Seed1.8 Model Card: Towards Generalized Real-World Agency

About

We present Seed1.8, a foundation model aimed at generalized real-world agency: going beyond single-turn prediction to multi-turn interaction, tool use, and multi-step execution. Seed1.8 keeps strong LLM and vision-language performance while supporting a unified agentic interface-search, code generation and execution, and GUI interaction. For deployment, it offers latency- and cost-aware inference, including configurable thinking modes and optimized visual encoding for images and video. We report evaluations on standard benchmarks and application-aligned workflows spanning foundational skills, multimodal understanding, and agentic behavior. Seed1.8 is released to support further research and development on interactive, real-world use cases.

Bytedance Seed• 2026

Related benchmarks

Task	Dataset	Result
Video Understanding	VideoMME	--	369
Streaming Video Understanding	StreamingBench	--	308
Temporal Video Understanding	TempCompass	--	160
Video Understanding	LongVideoBench	--	128
Video Understanding	LVBench	--	95
Video Understanding	MMVU	--	91
Video Understanding	VideoMMMU	--	67
GUI Navigation	AndroidWorld latest (test)	Success Rate70.7	48
Computer Use	OSWorld	--	45
GUI Agent Task Completion	OSWorld 1.0 (test)	Success Rate (OS)16	42

Showing 10 of 92 rows

...

Other info

Follow for update

@wizwand_team Discord