Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Compress the Context, Keep the Commitments: A Formal Framework for Verifiable LLM Context Compression

About

LLM context is not just tokens; it is a set of commitments. Long-running conversations accumulate goals, constraints, decisions, preferences, tool results, retrieved evidence, artifacts, and safety boundaries that future responses must preserve. Existing context-management methods reduce length through truncation, retrieval, summarization, memory systems, or token-level prompt compression, but they rarely specify which semantic commitments must survive compression or how their preservation should be measured. We propose Context Codec, a commitment-level framework for compressing prompts and chat histories. Context Codec represents dialogue state as typed, source-grounded semantic atoms with canonical identity, equivalence, conflict, confidence, risk, and evidence spans. It separates five concerns - extraction, normalization, representation, rendering, and verification - and introduces metrics for Critical Atom Recall, Weighted Atom Recall, Commitment Density, and round-trip recoverability. It also defines a taxonomy of semantic compression errors, a concrete normalization procedure, conservative fallback rules for low-confidence and safety-critical atoms, and Context Compression Language (CCL), an ASCII-first compact rendering of canonical JSON atoms. In a small diagnostic study, CCL-Core occupies a useful middle ground between structured prose and JSON: more explicit and auditable than prose, usually more compact than JSON, and less risky than heavily minified notation. The result is not a claim that shorthand solves compression, but a framework for making context compression verifiable: compress the conversation, keep the commitments.

Natalia Trukhina, Vadim Vashkelis• 2026

Related benchmarks

TaskDatasetResultRank
Context CompressionDiagnostic Cases Epidemic
Compression Ratio113
5
Context CompressionDiagnostic Cases (Trip)
Compression Score76
5
Prompt CompressionDiagnostic Cases Aggregate
Average Compression Tokens71.2
5
Context CompressionDiagnostic Cases React
Compression Ratio52
5
Context CompressionDiagnostic Cases Python
Comp.57
5
Context CompressionDiagnostic Cases (Research)
Compression Ratio58
5
Showing 6 of 6 rows

Other info

Follow for update