Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clinical Diagnosis on RareBench HMS subset n=88
Loading...
51.14
Recall@1
Mixed-Vendor MAC
39.3152
42.3851
45.455
48.5249
Feb 14, 2026
Recall@1
Recall@3
Recall@5
Recall@10
Updated 1mo ago
Evaluation Results
Method
Method
Links
Recall@1
Recall@3
Recall@5
Recall@10
Mixed-Vendor MAC
Setup=Mixed-Vendor MAC
2026.02
51.14
60.23
68.18
77.27
Gemini (Gemini-2.5-Pro)
Setup=Single-LLM
2026.02
47.73
59.09
64.77
70.45
OpenAI (o4-mini)
Setup=Single-Vendor MAC
2026.02
47.73
56.82
62.5
77.27
OpenAI (o4-mini)
Setup=Single-LLM
2026.02
43.18
57.95
68.18
76.14
Gemini (Gemini-2.5-Pro)
Setup=Single-Vendor MAC
2026.02
43.18
51.14
56.82
59.09
Claude (Claude-4.5-Sonnet)
Setup=Single-Vendor MAC
2026.02
43.18
56.82
67.05
72.73
Claude (Claude-4.5-Sonnet)
Setup=Single-LLM
2026.02
39.77
56.82
65.91
75
Feedback
Search any
task
Search any
task