Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

All Datasets

Benchmarks

Task NameDataset NameSOTA ResultTrend
Retrieval-Augmented GenerationAll Datasets Aggregated
Average Performance Score76.6
55
Exponent beta estimation for power-law entropy growthAll datasets
Beta Exponent0.533
24
Binary Classification (Assistive vs Creative)All Datasets Combined
AUC99
12
Binary Classification (Human vs Creative)All Datasets Combined
AUC0.99
12
Binary Classification (Human vs Assistive)All Datasets Combined
AUC98
12
Generalized Category DiscoveryAll Datasets Avg
Overall Accuracy75.1
12
Critical transition detectionAll datasets
AUROC85.9
9
Ancient inscription restorationAll Datasets Average
SSIM93.14
9
Ancient Inscription Texture RestorationAll Datasets Averages
PSNR37.6393
9
Ancient Inscription RestorationAll Datasets Average
LPIPS0.0764
9
Lesion SegmentationAll Datasets
BBox Score0.777
6
Image GenerationAll Datasets
Fidelity54
4
Preference PredictionAll Datasets Total
Significant Features Count (S)43
2
Alpha-law validationAll datasets
Clean Accuracy31.3
1
Showing 14 of 14 rows