Compress the Context, Keep the Commitments: A Formal Framework for Verifiable LLM Context Compression

About

LLM context is not just tokens; it is a set of commitments. Long-running conversations accumulate goals, constraints, decisions, preferences, tool results, retrieved evidence, artifacts, and safety boundaries that future responses must preserve. Existing context-management methods reduce length through truncation, retrieval, summarization, memory systems, or token-level prompt compression, but they rarely specify which semantic commitments must survive compression or how their preservation should be measured. We propose Context Codec, a commitment-level framework for compressing prompts and chat histories. Context Codec represents dialogue state as typed, source-grounded semantic atoms with canonical identity, equivalence, conflict, confidence, risk, and evidence spans. It separates five concerns - extraction, normalization, representation, rendering, and verification - and introduces metrics for Critical Atom Recall, Weighted Atom Recall, Commitment Density, and round-trip recoverability. It also defines a taxonomy of semantic compression errors, a concrete normalization procedure, conservative fallback rules for low-confidence and safety-critical atoms, and Context Compression Language (CCL), an ASCII-first compact rendering of canonical JSON atoms. In a small diagnostic study, CCL-Core occupies a useful middle ground between structured prose and JSON: more explicit and auditable than prose, usually more compact than JSON, and less risky than heavily minified notation. The result is not a claim that shorthand solves compression, but a framework for making context compression verifiable: compress the conversation, keep the commitments.

Natalia Trukhina, Vadim Vashkelis• 2026

Related benchmarks

Task	Dataset	Result
Context Compression	Diagnostic Cases Epidemic	Compression Ratio113	5
Context Compression	Diagnostic Cases (Trip)	Compression Score76	5
Prompt Compression	Diagnostic Cases Aggregate	Average Compression Tokens71.2	5
Context Compression	Diagnostic Cases React	Compression Ratio52	5
Context Compression	Diagnostic Cases Python	Comp.57	5
Context Compression	Diagnostic Cases (Research)	Compression Ratio58	5

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord