Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Medical Diagnosis on DDXPlus original (test)
Loading...
0.953
Similarity Score
MedExAgent-8B
0.4798
0.60265
0.7255
0.84835
May 8, 2026
Similarity Score
Jaccard Index
Accuracy
Updated 23d ago
Evaluation Results
Method
Method
Links
Similarity Score
Jaccard Index
Accuracy
MedExAgent-8B
Model category=Medical...
2026.05
0.953
0.937
93.9
Aloe-Beta-70B
Model category=Medical...
2026.05
0.716
0.503
67
Qwen3-32B
Model category=General...
2026.05
0.685
0.466
62.6
HuatuoGPT-o1-70B
Model category=Medical...
2026.05
0.664
0.41
47
Qwen3-14B
Model category=General...
2026.05
0.659
0.399
53.8
Llama-3.1-8B-Instruct
Model category=General...
2026.05
0.64
0.383
60.5
Qwen3-8B
Model category=General...
2026.05
0.605
0.328
46
MedGemma-27B-text-it
Model category=Medical...
2026.05
0.498
0.241
32.5
Feedback
Search any
task
Search any
task