Tendem: A Hybrid AI+Human Platform
About
Tendem is a hybrid system where AI handles structured, repeatable work and Human Experts step in when the models fail or to verify results. Each result undergoes a comprehensive quality review before delivery to the Client. To assess Tendem's performance, we conducted a series of in-house evaluations on 94 real-world tasks, comparing it with AI-only agents and human-only workflows carried out by Upwork freelancers. The results show that Tendem consistently delivers higher-quality outputs with faster turnaround times. At the same time, its operational costs remain comparable to human-only execution. On third-party agentic benchmarks, Tendem's AI Agent (operating autonomously, without human involvement) performs near state-of-the-art on web browsing and tool-use tasks while demonstrating strong results in frontier domain knowledge and reasoning.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-domain Knowledge and Reasoning | HLE (Humanity’s Last Exam) (official) | Exact Match39 | 7 | |
| Assistant Tasks | GAIA (official) | Exact Match78.2 | 6 | |
| Web Browsing | BrowseComp (official) | Exact Match71 | 5 | |
| Task Completion | Internal Task Benchmark | Avg Connection Time (hours)4.8 | 3 |