IDEA: An Interpretable and Editable Decision-Making Framework for LLMs via Verbal-to-Numeric Calibration

About

Large Language Models are increasingly deployed for decision-making, yet their adoption in high-stakes domains remains limited by miscalibrated probabilities, unfaithful explanations, and inability to incorporate expert knowledge precisely. We propose IDEA, a framework that extracts LLM decision knowledge into an interpretable parametric model over semantically meaningful factors. Through joint learning of verbal-to-numerical mappings and decision parameters via EM, correlated sampling that preserves factor dependencies, and direct parameter editing with mathematical guarantees, IDEA produces calibrated probabilities while enabling quantitative human-AI collaboration. Experiments across five datasets show IDEA with Qwen-3-32B (78.6%) outperforms DeepSeek R1 (68.1%) and GPT-5.2 (77.9%), achieving perfect factor exclusion and exact calibration -- precision unattainable through prompting alone. The implementation is publicly available at https://github.com/leonbig/IDEA.

Yanji He, Yuxin Jiang, Yiwen Wu, Bo Huang, Jiaheng Wei, Wei Wang• 2026

Related benchmarks

Task	Dataset	Result
Three-way probability ranking	COMMON2SENSE paired	F1 (C1)76.2	30
Binary decision	BIGDATA 22	Accuracy69.3	27
Binary decision	German Credit	Accuracy68.9	27
Binary decision	COMMON2SENSE	Accuracy95.1	27
Binary decision	PLASMA	Accuracy84.5	27
Binary decision	TODAY	Accuracy75	27

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord