Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

In-house

Benchmarks

Task NameDataset NameSOTA ResultTrend
ClassificationIn-house
Accuracy92.27
26
Mean Deviation predictionIn-house (test)
R2 Score0.229
22
Speech RecognitionIn-House dataset
CER0.0305
19
Multimodal ResearchIn-house 2
Accuracy37.7
18
Multimodal ResearchIn-house 1
Accuracy52.5
18
Medical VQAIn-house (Held-out)
Accuracy80.5
16
SegmentationHeld-out in-house (test)
IOU72.09
13
Polyp DetectionIn-house 1.0 (test)
Precision94.2
12
Clinical text revisionIn house Radiology Report
Mistral Score0.79
11
PET reconstructionIn-House (1% Count)
SSIM0.974
10
PET reconstructionIn-House 10% Count real (test)
SSIM0.9852
10
Abnormal classificationIn-house (test)
AUC88.4
10
Asymmetric classificationIn-house (test)
AUC0.907
10
Text-to-Image Generationin-house (test)
Overall HP Score5.39
9
Colorectal polyp classificationIn-house WLI (test)
Accuracy0.802
8
Thyroid Ultrasound SegmentationIn-house
DSC82.59
7
Lung nodule malignancy predictionIn-house (test)
AUC (<10mm)0.899
7
Contrast-enhanced breast MRI synthesisIn-house
SSIM90.9
6
Semantic SegmentationIn-house
Accuracy93.7
6
Explanation quality evaluationIn-house Dataset
Helpfulness80.8
6
Dynamic Reconstructionin-house dataset
SSIMa66.81
6
Staging & DiagnosisIn-house n=640 (test)
BAcc84.3
6
Automatic Speech RecognitionIn-house EN music (test)
WER24.61
5
Automatic Speech RecognitionIn-house EN speech (test)
WER9.42
5
Automatic Speech RecognitionIn-house ZH domain H (test)
WER3.59
5
Showing 25 of 33 rows