Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

IDEA: An Interpretable and Editable Decision-Making Framework for LLMs via Verbal-to-Numeric Calibration

About

Large Language Models are increasingly deployed for decision-making, yet their adoption in high-stakes domains remains limited by miscalibrated probabilities, unfaithful explanations, and inability to incorporate expert knowledge precisely. We propose IDEA, a framework that extracts LLM decision knowledge into an interpretable parametric model over semantically meaningful factors. Through joint learning of verbal-to-numerical mappings and decision parameters via EM, correlated sampling that preserves factor dependencies, and direct parameter editing with mathematical guarantees, IDEA produces calibrated probabilities while enabling quantitative human-AI collaboration. Experiments across five datasets show IDEA with Qwen-3-32B (78.6%) outperforms DeepSeek R1 (68.1%) and GPT-5.2 (77.9%), achieving perfect factor exclusion and exact calibration -- precision unattainable through prompting alone. The implementation is publicly available at https://github.com/leonbig/IDEA.

Yanji He, Yuxin Jiang, Yiwen Wu, Bo Huang, Jiaheng Wei, Wei Wang• 2026

Related benchmarks

TaskDatasetResultRank
Three-way probability rankingCOMMON2SENSE paired
F1 (C1)76.2
30
Binary decisionBIGDATA 22
Accuracy69.3
27
Binary decisionGerman Credit
Accuracy68.9
27
Binary decisionCOMMON2SENSE
Accuracy95.1
27
Binary decisionPLASMA
Accuracy84.5
27
Binary decisionTODAY
Accuracy75
27
Showing 6 of 6 rows

Other info

Follow for update