Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Medical Logical Reasoning (Rule 3: is_mel_on_back) on HAM 5000 samples (test)
Loading...
99.8
F1 (NS-CL)
GroundTruth (Oracle)
-3.16
23.57
50.3
77.03
May 5, 2026
F1 (NS-CL)
F1 (Decision Tree)
Updated 22d ago
Evaluation Results
Method
Method
Links
F1 (NS-CL)
F1 (Decision Tree)
GroundTruth (Oracle)
Supervision (%)=Oracle/GT
2026.05
99.8
100
GlobalVAE
Supervision (%)=75%
2026.05
90.8
88.4
SlotVAE (k=1)
Supervision (%)=75%
2026.05
60.6
40
DINOv2
Supervision (%)=75%
2026.05
0.8
18.4
Feedback
Search any
task
Search any
task