Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DeepReviewer Evaluation Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
Autonomous research paper generationDeepReviewer Evaluation Suite (NeurIPS 2023, IJCV 2025, and ICLR 2025 baseline papers)
Review Score5.75
4
Showing 1 of 1 rows