Adaptive Constraint Propagation: Scaling Structured Inference for Large Language Models via Meta-Reinforcement Learning
About
Large language models increasingly require structured inference, from JSON schema enforcement to multi-lingual parsing, where outputs must satisfy complex constraints. We introduce MetaJuLS, a meta-reinforcement learning approach that learns universal constraint propagation policies applicable across languages and tasks without task-specific retraining. By formulating structured inference as adaptive constraint propagation and training a Graph Attention Network with meta-learning, MetaJuLS achieves 1.5--2.0$\times$ speedups over GPU-optimized baselines while maintaining within 0.2\% accuracy of state-of-the-art parsers. On Universal Dependencies across 10 languages and LLM-constrained generation (LogicBench, GSM8K-Constrained), MetaJuLS demonstrates rapid cross-domain adaptation: a policy trained on English parsing adapts to new languages and tasks with 5--10 gradient steps (5--15 seconds) rather than requiring hours of task-specific training. Mechanistic analysis reveals the policy discovers human-like parsing strategies (easy-first) and novel non-intuitive heuristics. By reducing propagation steps in LLM deployments, MetaJuLS contributes to Green AI by directly reducing inference carbon footprint.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Constituency Parsing | Penn Treebank WSJ (section 23 test) | F1 Score95.6 | 55 | |
| Constrained Decoding | LogicBench | Constraint Satisfaction98.5 | 7 | |
| Dependency Parsing | Universal Dependencies (UD) English v2.11 | LAS94.9 | 6 | |
| Constraint Programming | MiniZinc Challenge 2022-2024 | Avg. Gap0.066 | 4 | |
| Constrained Decoding | GSM8K Constrained | Constraint Satisfaction96.8 | 3 | |
| Dependency Parsing | Universal Dependencies (UD) Spanish v2.11 | LAS (%)94.8 | 1 | |
| Dependency Parsing | Universal Dependencies (UD) Chinese v2.11 | LAS93.2 | 1 | |
| Dependency Parsing | Universal Dependencies (UD) Arabic v2.11 | LAS92.1 | 1 | |
| Dependency Parsing | Universal Dependencies (UD) Finnish v2.11 | LAS (%)91.8 | 1 |