Tendem: A Hybrid AI+Human Platform

About

Tendem is a hybrid system where AI handles structured, repeatable work and Human Experts step in when the models fail or to verify results. Each result undergoes a comprehensive quality review before delivery to the Client. To assess Tendem's performance, we conducted a series of in-house evaluations on 94 real-world tasks, comparing it with AI-only agents and human-only workflows carried out by Upwork freelancers. The results show that Tendem consistently delivers higher-quality outputs with faster turnaround times. At the same time, its operational costs remain comparable to human-only execution. On third-party agentic benchmarks, Tendem's AI Agent (operating autonomously, without human involvement) performs near state-of-the-art on web browsing and tool-use tasks while demonstrating strong results in frontier domain knowledge and reasoning.

Konstantin Chernyshev, Ekaterina Artemova, Viacheslav Zhukov, Maksim Nerush, Mariia Fedorova, Iryna Repik, Olga Shapovalova, Aleksey Sukhorosov, Vladimir Dobrovolskii, Natalia Mikhailova, Sergei Tilga• 2026

Related benchmarks

Task	Dataset	Result
Multi-domain Knowledge and Reasoning	HLE (Humanity’s Last Exam) (official)	Exact Match39	7
Assistant Tasks	GAIA (official)	Exact Match78.2	6
Web Browsing	BrowseComp (official)	Exact Match71	5
Task Completion	Internal Task Benchmark	Avg Connection Time (hours)4.8	3

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord