Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-turn clinical response generation on MAQuE (test)
Loading...
61.5
Accuracy
SEMA-RAG
52.14
54.57
57
59.43
May 16, 2026
Accuracy
Robustness
Relevance
Empathy
Updated 15d ago
Evaluation Results
Method
Method
Links
Accuracy
Robustness
Relevance
Empathy
SEMA-RAG
Backbone=deepseek-v3.1
2026.05
61.5
75.38
82
72
MedRAG
Backbone=deepseek-v3.1
2026.05
52.5
64.86
74
66.4
Feedback
Search any
task
Search any
task