Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AGORA: Adapter-Grounded Observation-Action Retention for Inference-Free Prompt Compression in LLM Agents

About

The token-level extractive compressors widely used for general LM context are structurally inappropriate for LLM agents: across 17 (env, backbone, method) cells spanning two independent token-level method families, every cell collapses to mean reward <= 0.05 despite 1.3-13.3x realized compression. We name and characterize this failure mode as action-grammar destruction -- the tokens carrying action semantics (identifiers, brackets, action verbs) are exactly those self-information ranks lowest, so a general-purpose compressor reliably removes them and the environment rejects the residual. The diagnosis points to step-granularity compression. We introduce AGORA, an inference-free step-level compressor combining a structural prompt parser, an always-keep floor for format- and recency-critical content, and a 125M-parameter relevance scorer trained on counterfactual next-action-change labels (~2ms/step, zero per-step LLM toll). Across the compared inference-free and LLM-based methods, AGORA is the only one retaining >= 75% uncompressed performance in 8 of 9 cells (with the lone exception at 73%); a four-way component ablation isolates the structural floor as the dominant quality lever and the learned scorer as the source of 1.0-11.5x adaptive end-to-end compression from a single fixed keep ratio.

Haoran Zhang, Zhaohua Sun• 2026

Related benchmarks

TaskDatasetResultRank
Mean RewardAlfWorld
Mean Reward0.767
30
Mean RewardScienceWorld
Mean Reward0.217
30
Mean RewardWebshop
Mean Reward46.8
30
Interactive Agent TaskAlfWorld
Effective Steps Multiplier1
15
Interactive Agent TaskScienceWorld
Efficiency Factor11.5
15
Interactive Agent TaskWebshop
Efficiency Multiplier8.9
15
Showing 6 of 6 rows

Other info

Follow for update