Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

S3

Benchmarks

Task NameDataset NameSOTA ResultTrend
Clinical diagnosisS3 1.0 (test)
Precision98
36
Cluster count selectionS3
Selected Cluster Count10
21
ClusteringS3 N(200)
ACC88.1
20
Synthetic Function OptimizationS3 Perm. Rosen.
Median LogGap1.9114
14
Narrative report generationS3 1.0 (test)
RQI Score39.8
12
Heterogeneous Treatment Effect EstimationS3 zeta=3, no overlap Synthetic (test)
RMSE (L=10)0.225
9
Nuclear SegmentationS3
Coverage0.9944
6
Human Novel-view RenderingS3 4K
PSNR30.0311
6
Human Novel-view RenderingS3 1K
PSNR33.196
6
Point-level consensus correctness predictionS3
AUPRC93.2
4
Task performance and governanceS3 Threshold
CDL46.5
2
ClusteringS3
SmM0.6703
1
Showing 12 of 12 rows