Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Minimizing Mismatch Risk: A Prototype-Based Routing Framework for Zero-shot LLM-generated Text Detection

About

Zero-shot methods detect LLM-generated text by computing statistical signatures using a surrogate model. Existing approaches typically employ a fixed surrogate for all inputs regardless of the unknown source. We systematically examine this design and find that detection performance varies substantially depending on surrogate-source alignment. We observe that while no single surrogate achieves optimal performance universally, a well-matched surrogate typically exists within a diverse pool for any given input. This finding transforms robust detection into a routing problem: selecting the most appropriate surrogate for each input. We propose DetectRouter, a prototype-based framework that learns text-detector affinity through two-stage training. The first stage constructs discriminative prototypes from white-box models; the second generalizes to black-box sources by aligning geometric distances with observed detection scores. Experiments on EvoBench and MAGE benchmarks demonstrate consistent improvements across multiple detection criteria and model families.

Ke Sun, Guangsheng Bao, Han Cui, Yue Zhang• 2026

Related benchmarks

TaskDatasetResultRank
LLM-generated text detectionEvoBench
LLaMA3 Score95.66
26
Machine-generated text detectionMAGE--
18
Showing 2 of 2 rows

Other info

Follow for update