Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NB-Curated

Benchmarks

Task NameDataset NameSOTA ResultTrend
Model RoutingNB-Curated OOD
Uniqueness35.4
11
Reward ScoringNB-curated (evaluation set)
Mean Reward Score33.64
6
Showing 2 of 2 rows