Share your thoughts, 1 month free Claude Pro on usSee more

Question Answering Feedback Generation on QA-FEEDBACK (test)

0.513Rs1 (Relevance)

SFT

Updated 4mo ago

Evaluation Results

Method	Links
SFT 2023.06		0.513	0.749	-0.053	48.96
FINE-GRAINED RLHF 2023.06		0.513	0.816	0.139	49.93
SFT-Full 2023.06		0.508	0.756	0.044	49.63
Pref. RLHF 2023.06		0.482	0.781	0.101	49.84