Share your thoughts, 1 month free Claude Pro on usSee more

Peer Review Feedback Generation on ICLR papers

45.8Combined Success Rate

GPT-5.2

Updated 3mo ago

Evaluation Results

Method	Links
GPT-5.2 2026.04		45.8	1	46.3	1	45.8	1
Gemini-3-flash 2026.04		37.9	0.9	39.4	0.9	37.9	0.9
GOODPOINT-DPO 2026.04		14.7	0.5	14.9	0.5	14.7	0.5
GOODPOINT-SFT 2026.04		9.2	0.5	9.7	0.5	9.2	0.5
Qwen3-8b (Base) 2026.04		8	0.6	8.1	0.6	8	0.6
Llama3.1-8b-Instruct 2026.04		1.8	0.3	1.8	0.3	1.8	0.3