Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Treatment Exploration on UCSF-ALPTDG (test)
Loading...
50.5
Precision
CLARITY
33.028
37.564
42.1
46.636
Dec 8, 2025
Precision
Recall
F1 Score
Updated 3mo ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
CLARITY
description=Our Approach
2025.12
50.5
47.5
48.9
Claude-4.5-Sonnet
category=General Large...
2025.12
45.3
38.6
41.7
Huatuo-Vision
category=Medical Knowl...
2025.12
42.1
51.5
44.1
MeWM*
category=Medical Knowl...
2025.12
39.3
48.2
43.3
GPT-4o
category=General Large...
2025.12
36.9
44.9
40.5
MedGPT
category=Medical Knowl...
2025.12
36.7
46.3
40.9
Qwen3-VL
category=General Large...
2025.12
33.7
42.9
35.8
Feedback
Search any
task
Search any
task