Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NIH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Lung segmentationNIH
Recall97.4
16
Chest X-ray classificationNIH (test)
AUROC (Micro)89.3
14
Pancreas SegmentationNIH
DSC0.8947
8
Out-of-Distribution DetectionNIH
AUF91.5
8
Chest X-ray ClassificationNIH manually re-labelled clean (test)
Pneumothorax AUC89.1
8
Anomaly DetectionNIH clearer (test)
AUC94.6
7
Long-context retrievalNIH
Multi-needle Avg Recall100
6
Medical Image ClassificationNIH (100% labeled)
AUC78.7
6
Medical Image ClassificationNIH 10% labeled
AUC71.6
6
Medical Image ClassificationNIH (1% labeled)
AUC0.622
6
Anomaly DetectionNIH AP projection (test)
AUC60.1
6
Anomaly DetectionNIH PA projection (test)
AUC0.708
6
Long context understandingNIH Multi-needle
Accuracy100
5
Image ClassificationNIH
Accuracy56.4
5
Out-of-Distribution DetectionNIH ID (Xray) vs OOD (Xray)
AUROC0.54
3
Binary ClassificationNIH 10-fold cross-validation local model
Mean F194
2
Showing 16 of 16 rows