Share your thoughts, 1 month free Claude Pro on usSee more

Conversational response generation on MD2Dial

31.2F1 Score

ChatR1-7b

Updated 2mo ago

Evaluation Results

Method	Links
ChatR1-7b 2025.10		31.2	84.5
ChatR1 (w/o Rint.) 2025.10		26.4	77.4
ChatR1-3b 2025.10		26	83.1
SFT 2025.10		25.4	84.2
QR Search R1 2025.10		23.1	82.1
ChatGPT (DI) 2025.10		21.6	81.7
Qwen-Instr. (RAG) 2025.10		18.8	75.1
CoT R1 2025.10		18	80.2
IRCoT 2025.10		13.3	67.5
Qwen-Instr. (DI) 2025.10		13.2	64.4
Qwen-Instr. (CoT) 2025.10		10.5	63.6