Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ICLR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Node RetrievalICLR 2025 (500 papers)
Recall @ 90.172
16
Paper Acceptance DecisionICLR 2025 (test)
Accuracy71.92
15
Paper Quality EvaluationICLR 2025 (test)
Jaccard Index37.98
15
Multi-turn role-playICLR
Success Rate (SR)96.2
12
Review Score GenerationICLR 2025
Average Review Score6.4
10
Scientific Review Feedback GenerationICLR LLM-as-a-Judge 2025 (test)
Actionability Score3.38
9
Scientific Review Feedback GenerationICLR Human Evaluation 2025 (test)
Actionability3.46
9
Holistic Technical Quality EvaluationICLR 2025
Originality3.35
8
Empathetic DialogueICLR
Success Rate (SR)96.7
5
Citation Coverage EvaluationICLR 2025
Avg Cites45.73
3
Coverage-based AlignmentICLR 50 submissions 2026
Str-Cov88.6
3
Score-based AlignmentICLR 2026 (50 submissions)
R-MSE0.148
3
Showing 12 of 12 rows