Binding Language Models in Symbolic Languages

About

Though end-to-end neural approaches have recently been dominating NLP tasks in both performance and ease-of-use, they lack interpretability and robustness. We propose Binder, a training-free neural-symbolic framework that maps the task input to a program, which (1) allows binding a unified API of language model (LM) functionalities to a programming language (e.g., SQL, Python) to extend its grammar coverage and thus tackle more diverse questions, (2) adopts an LM as both the program parser and the underlying model called by the API during execution, and (3) requires only a few in-context exemplar annotations. Specifically, we employ GPT-3 Codex as the LM. In the parsing stage, with only a few in-context exemplars, Codex is able to identify the part of the task input that cannot be answerable by the original programming language, correctly generate API calls to prompt Codex to solve the unanswerable part, and identify where to place the API calls while being compatible with the original grammar. In the execution stage, Codex can perform versatile functionalities (e.g., commonsense QA, information extraction) given proper prompts in the API calls. Binder achieves state-of-the-art results on WikiTableQuestions and TabFact datasets, with explicit output programs that benefit human debugging. Note that previous best systems are all finetuned on tens of thousands of task-specific samples, while Binder only uses dozens of annotations as in-context exemplars without any training. Our code is available at https://github.com/HKUNLP/Binder .

Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu• 2022

Related benchmarks

Task	Dataset	Result
Table Question Answering	WikiTQ	Accuracy64.6	149
Table Fact Verification	TabFact (test)	Accuracy84.63	146
Table Question Answering	WikiTQ (test)	Accuracy64.6	140
Table Fact Verification	TabFact	Accuracy0.851	104
Table Question Answering	WTQ	Accuracy64.6	101
Table Question Answering	WikiTableQuestions (test)	Accuracy61.9	86
Fact Verification	TabFact	Accuracy85.1	83
Table Question Answering	NQ-Table	F1 Score67.44	63
Table Fact Verification	TabFact small (test)	Accuracy0.851	57
Table Question Answering	WikiTQ	F1 Score61.68	50

Showing 10 of 45 rows

Other info

Code

Follow for update

@wizwand_team Discord