Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rigorous Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Deep research report generationRigorous Bench
Quality0.6257
22
Showing 1 of 1 rows