Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Review Score Generation on NeurIPS 2025
Loading...
4.8
Avg Review Score
PAA
2.2
2.875
3.55
4.225
Jan 11, 2026
Avg Review Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Avg Review Score
PAA
Attacking Model=Gemini...
2026.01
4.8
PAA
Attacking Model=Sonnet...
2026.01
4.7
PAA
Attacking Model=Sonnet...
2026.01
4.5
PAA
Attacking Model=GPT-4o...
2026.01
4.4
PAA
Attacking Model=Gemini...
2026.01
4.3
PAA
Attacking Model=GPT-4o...
2026.01
4.1
PAA
Attacking Model=OLMo 3...
2026.01
3.3
PAA
Attacking Model=Qwen 3...
2026.01
2.9
Paraphrase
2026.01
2.5
Original
2026.01
2.3
Feedback
Search any
task
Search any
task