Training Language Models to Generate Text with Citations via Fine-grained Rewards

About

While recent Large Language Models (LLMs) have proven useful in answering user queries, they are prone to hallucination, and their responses often lack credibility due to missing references to reliable sources. An intuitive solution to these issues would be to include in-text citations referring to external documents as evidence. While previous works have directly prompted LLMs to generate in-text citations, their performances are far from satisfactory, especially when it comes to smaller LLMs. In this work, we propose an effective training framework using fine-grained rewards to teach LLMs to generate highly supportive and relevant citations, while ensuring the correctness of their responses. We also conduct a systematic analysis of applying these fine-grained rewards to common LLM training strategies, demonstrating its advantage over conventional practices. We conduct extensive experiments on Question Answering (QA) datasets taken from the ALCE benchmark and validate the model's generalizability using EXPERTQA. On LLaMA-2-7B, the incorporation of fine-grained rewards achieves the best performance among the baselines, even surpassing that of GPT-3.5-turbo.

Chengyu Huang, Zeqiu Wu, Yushi Hu, Wenya Wang• 2024

Related benchmarks

Task	Dataset	Result
Question Answering	ASQA (test)	Correctness EM Recall77.83	50
Citation-augmented Question Answering	bar-GT, PK 1.0 (test)	Accuracy65.24	42
Citation-augmented Question Answering	GT, PK 1.0 (test)	Accuracy76.65	21
Attributed Text Generation	ASQA	Correctness (EM Rec.)40.1	19
Attributed Text Generation	QAMPARI	Correctness Recall-516.7	19
Attributed Text Generation	ELI5	Claim Correctness Score11.5	19
Question Answering	QAMPARI (test)	Correctness Rec@518.86	17
Attributable Text Generation	ExpertQA v1 (test)	AutoAIS0.6612	9
Question Answering	EXPERTQA (test)	Claim Recall15.53	6

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord